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A  REVIEW  AND  EVALUATION  OF  RELIABILITY  CONCEPTS  FOR  DESIGN 


OF  WATER  DISTRIBUTION  SYSTEMS 


PART  I:  INTRODUCTION 


Background 


1.  The  American  Water  Works  Association  (AWWA)  (1974)  defines  a  water 
distribution  system  as  "including  all  water  utility  components  for  the  dis¬ 
tribution  of  finished  or  potable  water  by  means  of  gravity  storage  feed  or 
pumps  through  distribution-pumping  networks  to  customers  or  other  users,  in¬ 
cluding  distribution-equalizing  storage."  The  design  or  extension  of  a  water 
distribution  system  generally  involves  large  capital  outlays  as  well  as  the 
continuing  operation,  maintenance,  and  repair  costs.  Because  of  the  complex¬ 
ity  of  the  problems  arising  from  the  large  number  of  design  components  and 
their  interaction,  automated  procedures  that  result  in  reliable,  but  minimum 
cost,  designs  are  desired.  Conventional  design  approaches  consist  of  select¬ 
ing  a  network  configuration,  pipe  sizes,  reservoir  sizes  and  elevations,  and 
pumping  facilities  using  a  trial  and  error  procedure  that  attempts  to  find  a 
workable  design  representing  a  low-cost  solution.  No  guarantee  can  be  made 
that  the  resulting  distribution  system  is  a  minimum  cost  solution  nor  is  any 
measure  made  of  the  reliability  of  the  designed  system. 

2.  Considerable  emphasis  has  recently  been  placed  on  the  problem  of  the 
state  of  decay  in  the  Nation's  infrastructure  because  of  its  importance  to  the 
needs  of  society  and  industrial  growth.  Large  capital  expenditures  will  be 
needed  to  bring  the  concerned  systems  to  higher  levels  of  serviceability  and 
to  create  a  vigor  in  the  US  industrial  competitiveness  in  the  world.  One  of 
the  most  vital  services  is  an  adequate  water  supply  system,  without  which 
industry  cannot  survive. 

3.  The  lack  of  adequate  water  supply  systems  is  due  to  the  deteriora¬ 
tion  of  aging  water  supply  systems  in  older  urbanized  areas  and/or  to  the 
inadequate  capacity  of  water  supply  systems  in  man;.'  areas  that  are  undergoing 
rapid  urbanization,  such  as  in  the  southwestern  United  States.  In  other 
words,  methods  for  the  evaluation  of  the  Nation's  water  supply  services  need 
to  consider  not  onlv  the  rehabilitation  of  existing  urban  water  supply 


systems,  but  also  the  future  development  of  new  water  supply  systems  to  serve 
expanding  population  centers.  Both  the  adaptation  of  existing  technologies 
and  the  development  of  new,  innovative  technologies  will  be  required  to 
improve  the  efficiency  and  cost-effectiveness  of  future  and  existing  water 
supply  systems  and  facilities  necessary  for  industrial  growth. 

4.  Traditionally,  investments  in  infrastructure  maintenance  have  been 
small  and  given  minor  attention.  However,  a  US  Environmental  Protection 
Agency  (EPA)  survey  (Clark,  Stafford,  and  Goodrich  1982)  of  previous  water 
supply  projects  concluded  that  the  distribution  facilities  in  water  supply 
systems  account  for  the  largest  cost  item  in  future  maintenance  budgets.  The 
aging,  deteriorating  distribution  systems  in  many  areas  raise  tremendous  main¬ 
tenance  decisionmaking  problems  which  are  further  complicated  by  the  expansion 
of  existing  systems.  Deterioration  of  the  water  distribution  systems  in  many 
areas  has  translated  into  a  high  proportion  of  unaccounted-for  water  due  to 
leaks,  which  not  only  is  a  loss  of  a  valuable  resource,  but  also  raises  con¬ 
cerns  about  unsafe  drinking  water  caused  by  possible  contamination  through 
cracked  pipes.  As  an  example,  the  following  percentages  of  water  lost  through 
leaks  have  been  reported  in  the  literature:  17  percent  in  Boston,  15  percent 
in  St.  Louis,  15  percent  in  Cleveland,  14  percent  in  Pittsburgh,  and  14  per¬ 
cent  in  Tulsa  (Choate  and  Walter  1981).  Many  have  estimated  the  capital  needs 
to  rehabilitate  urban  water  distribution  systems  in  the  United  States  on  the 
order  of  $75  to  $110  billion  (1972  dollars)  ranging  over  the  next  20  years 
(Choate  and  Walter  1981). 

5.  The  deferral  of  critical  maintenance  causes  the  reliability  of  ex¬ 
isting  systems  to  decrease.  Recently  some  municipalities  have  been  more  will¬ 
ing  or  able  to  finance  rehabilitation  of  deteriorating  pipelines.  Other 
municipalities,  however,  still  defer  needed  maintenance  and  replacement  of 
system  components  until  some  catastrophic  event  occurs.  Water  main  failures 
have  been  extensive  in  many  cities.  As  an  example,  in  1973  in  Houston,  Tex., 
there  were  5,149  breaks  in  the  3,998  miles  of  water  mains,  which  is  1,290 
breaks  per  year  per  1,000  miles  of  mains.*  In  New  Orleans,  there  was  an  aver¬ 
age  of  680  breaks  per  year  per  1,000  miles  of  mains  in  the  1,519  miles  of 
mains  during  the  period  1969-1978. 


6.  Water  utilities  are  implemented  to  construct,  operate,  and  maintain 
water  supply  systems.  The  basic  function  of  these  water  utilities  is  to 
obtain  water  from  a  source,  treat  the  water  to  an  acceptable  quality,  and 
deliver  the  desired  quantity  of  water  at  the  desired  time.  The  analysis  of  a 
water  supply  system  is  often  devoted  to  the  evaluation  of  one  or  more  of  the 
five  major  system  components:  source,  raw  water  transmission,  treatment  sys¬ 
tem,  storage  system,  or  distribution  system.  The  storage  system  consists  of 
both  raw  water  storage  and  finished  water  storage. 

7.  To  ensure  reliable  delivery  of  finished  water  to  the  user,  the  water 
distribution  system  must  be  designed  to  accommodate  a  range  of  expected  load¬ 
ing  conditions.  For  example,  water  system  components  may  be  designed  to  meet 
maximum  day,  maximum  hourly,  average  day,  average  hourly,  or  maximum  instan¬ 
taneous  demands.  Fire  flow  demand  may  also  be  superimposed  on  these  "normal" 
demand  events.  In  addition,  the  system  must  be  designed  to  accommodate  abnor¬ 
mal  conditions  such  as  broken  pipes  and  appurtenances;  mechanical  failure  of 
systems  such  as  pumps,  valves,  control  systems;  power  outages;  failure  of 
storage  facilities;  system  contamination;  and  inaccurate  demand  projections. 
Each  of  these  conditions  must  be  examined  with  an  emphasis  on  describing  its 
impact  on  the  system,  developing  relevant  measures  of  system  performance,  and 
designing  into  the  system  the  capacity  required  to  handle  emergency  conditions 
with  an  acceptable  measure  of  reliability. 

8.  A  review  of  the  literature  (Part  II)  reveals  that  there  is  currently 
no  universally  accepted  definition  or  measure  of  the  reliability  of  water  dis¬ 
tribution  systems.  Reliability  is  usually  defined  as  the  probability  that  a 
system  performs  its  mission  within  specified  limits  for  a  given  period  of  time 
in  a  specified  environment.  For  a  large  system,  with  many  interactive  subsys¬ 
tems  (such  as  a  water  distribution  system),  it  is  extremely  difficult  to  ana¬ 
lytically  compute  the  mathematical  reliability.  Accurate  calculation  of  a 
mathematical  reliability  requires  knowledge  of  the  precise  reliability  of  the 
basic  subsystems  or  components  and  the  impact  on  mission  accompl ishment  caused 
by  the  set  of  all  possible  subsystem  (component)  failures. 

9.  The  so-called  "optimal"  or  "minimum"  cost  design  of  water  distribu¬ 
tion  systems  has  been  approached  from  many  different  directions,  including  the 
use  of  several  types  of  optimization  techniques.  These  techniques  are 
reviewed  in  Part  II.  None  of  these  approaches  are  completely  satisfactory 
because  of  their  many  limitations.  Most  of  the  approaches  place  emphasis  on 
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designing  Che  water  distribution  system  to  function  under  normal  loading  con¬ 
ditions  using  peak  hourly  demands,  etc.  Very  little  work  has  been  done  on 
abnormal  or  emergency  loading  conditions  such  as  fire  demands,  pump  failure, 
control  valve  failure,  power  outages,  and  broken  links. 

10.  Many  researchers,  municipal  engineers,  urban  planners,  institutes, 
government  agencies,  etc.,  have  discussed  the  need  to  develop  explicit  mea¬ 
sures  and  methodologies  to  evaluate  water  distribution  system  reliability  and 
performance  under  emergency  loading  conditions.  At  present,  there  is  no 
accepted  definition  or  measure  of  the  reliability  of  water  distribution  sys¬ 
tems.  Some  researchers  have  proposed  candidate  approaches  using  concepts  of 
reliability  factors,  economic  loss  functions,  forced  redundancy  in  the 
designs,  etc.  All  of  these  approaches  have  limitations  in  problem  formulation 
and/or  solution  technique.  Some  investigators  discuss  the  need  to  explicitly 
Incorporate  measures  of  reliability  into  optimization  models  to  predict  system 
operation  under  emergency  loading  conditions.  At  present,  however,  no 
"optimization-reliability"  evaluation  or  design  technique  with  general  appli¬ 
cation  has  been  developed. 

11.  While  the  Corps  of  Engineers,  in  its  Civil  Works  mission,  does  not 
have  general  authority  to  construct  or  operate  water  distribution  systems,  the 
field  operating  agencies  are  often  called  upon  to  solve  water  distribution 
problems  as  part  of  Section  22  studies  or  specifically  authorized  studies.  In 
addition,  the  Corps  is  directly  responsible  for  planning,  design,  and  con¬ 
struction  of  water  distribution  systems  to  serve  military  installations  and 
Civil  Works  recreation  areas.  These  Corps  activities  in  the  implementation  of 
water  distribution  systems  require  the  evaluation  of  alternative  system  plans 
and  designs.  The  application  of  reliability  engineering  concepts  to  the  eval¬ 
uation  of  water  distribution  system  component  reliability  can  help  to  ensure 
that  reasonable  trade-offs  between  cost  and  reliability  will  be  achieved. 

Purpose  and  Scope 

12.  The  major  purpose  of  this  study  was  to  assemble  information  related 
to  the  applicability  of  reliability  concepts  to  the  evaluation  of  water  dis¬ 
tribution  systems.  This  included  a  detailed  review  of  water  resources  and 
reliability  literature  and  development  of  a  conceptual  basis  for  a  technique 
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or  methodology  to  apply  reliability  concepts  to  the  design  and  evaluation  of 
water  distribution  systems. 

13.  The  following  work  elements  have  been  performed  in  order  to  accomp¬ 
lish  the  purpose  described  above: 

a.  A  detailed  review  of  the  water  resources  literature  was  made  to 
determine  and  evaluate  the  work  related  to  steady-state  simula¬ 
tion  models,  optimization  approaches,  reliability  evaluation, 
and  pipe  and  pump  failure  and  replacement  analysis. 

b.  A  detailed  review  was  performed  of  the  reliability  literature 
including:  electronics  literature,  power  systems  literature, 
chemical  engineering  literature,  mechanical  engineering  litera¬ 
ture,  operations  research  literature,  and  various  other  pro¬ 
ceedings  and  textbooks. 

£.  The  availability  of  commercial  computer  codes  was  evaluated  for 
the  reliability  analysis  of  networks. 
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Organization  of  Report 


14.  This  report  is  organized  into  separate  parts  describing  the  results 
of  the  state-of-the-art  review  of  water  distribution  system  reliability 
analysis: 

a.  A  review  of  previous  work  in  water  distribution  system  analysis 
and  reliability  evaluation  is  presented  in  Part  II. 

b.  Methods  for  evaluating  the  reliability  of  individual  water  dis¬ 
tribution  system  components  are  presented  in  Part  III. 

c.  Techniques  for  evaluating  the  reliability  of  systems  are  pre¬ 
sented  in  Part  IV. 

d.  Examples  for  evaluation  of  pumping  station  and  pipe  reliability 
and  availability  are  presented  in  Part  V. 

e.  Conclusions  and  recommendations  are  presented  in  Part  VI. 

f_.  Additional  references  on  reliability  analysis  are  presented  in 
Appendix  A,  while  Appendix  B  lists  the  available  computer  codes 
for  reliability  analysis. 
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PART  II:  PREVIOUS  WORK  IN  WATER  DISTRIBUTION  SYSTEM  ANALYSIS 


15.  This  section  presents  a  summary  of  previous  work  on  the  analysis  of 
water  distribution  networks.  Sections  are  included  for  four  major  classifi¬ 
cations  of  analytical  techniques.  First,  models  that  have  been  developed  for 
the  steady-state  analysis  of  flow  in  water  distribution  networks  are  dis¬ 
cussed.  Second,  models  that  have  been  developed  for  the  optimal  (usually 
minimum  cost)  design  of  water  distribution  systems  are  presented.  Third,  a 
general  description  of  the  very  few  efforts  in  the  literature  dealing  with  the 
reliability  of  water  distribution  systems  is  presented.  Finally,  studies 
incorporating  both  reliability  and  maintenance  considerations  are  discussed. 


Network  Analysis  Approaches 


16.  Numerous  models  for  the  steady-state  analysis  of  flow  in  water  dis¬ 
tribution  networks  have  been  reported  in  the  literature.  A  brief  description 
of  the  physical  laws  is  given  first,  then  a  brief  outline  of  the  various  types 
of  network  solvers  is  presented.  The  equation  of  continuity  for  each  node  can 
be  expressed  as 

N 

£  Q  +  b  =  0  j  *  1...N  (1) 

i-1  31  3 


where 

N 


total  number  of  nodes  in  the  network 

flow  from  node  j  to  node  i 

external  flow  (consumption)  from  node  j 


The  loop  equations  describe  the  continuity  of  head  around  closed  paths  in  a 
network  as 


where 


k  =  k-th  link 

=  the  set  of  links  in  the  4-th  loop 

AH,  =  head  loss  for  the  k-th  link 
k 

L  =  the  total  number  of  loops  in  the  network 

The  path  equations  describe  the  continuity  of  head  or  energy  along  any 
continuous  path  through  a  network  as 


H 

e 

P 


T.  AH 

keKp 


p  =  1 . . .  P 


(3) 


where 

H  =  head  at  the  end  of  the  path 

P 

H,  =  head  at  the  beginning  of  the  path 
b 

P 

K  =  the  set  of  links  for  the  p-th  path 

P 

17.  Network  solvers  are  computer  programs  designed  to  solve  the  steady- 
state  flow  problem  using  the  node  equation,  the  loop  equations,  the  path  equa¬ 
tions,  or  a  combination  of  these.  Mechler  (1970),'  Shamir  (1973),  Jeppson 
(1976),  and  Stephenson  (1976  and  1984)  have  given  surveys  of  network  solvers. 
The  more  popular  network  solvers  include  the  Hardy  Cross  method,  Newton- 
Raphson  method,  and  graph  theory  methods.  Table  1  lists  various  selected 
water  distribution  network  solvers. 

Optimization  Approaches 

18.  The  optimal  design  of  water  distribution  systems  has  been 
approached  using  a  number  of  optimization  tools.  These  models  have  been 
developed  for  determining  pipe  diameters,  network  layout,  pump  capacities, 
heights  of  elevated  reservoirs,  valve  location,  and  other  design  parameters. 
The  objective  functions  are  to  minimize  costs  including  acquisition,  opera¬ 
tion,  and  maintenance  costs.  The  capabilities  of  the  various  models  include 
the  type  of  system  analyzed  (branched  and/or  looped)  ,  the  number  of  sources 
allowed  (single  or  multiple) ,  and  the  number  of  loading  (demand)  design 


conditions  handled.  The  solution  techniques  range  from  linear  programming  to 
nonlinear  programming. 

19.  A  general  statement  of  the  optimization  problem  (minimizing  cost) 
for  designing  looped  distribution  systems  subject  to  satisfying  steady-state 
conditions  and  minimum  head  levels  under  single  loading  conditions  can  be 
stated  as: 


t 
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Min 


NLINK 
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subject  to 


NPUMP 

NSTOR 
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'  EL1+  £  (XPk+XSm) 
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i  e  NODES 

(7) 

/  'j  V 
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HMIN1  S  Hi  ^  HMAX1 

i  e  NODES 
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gw 

in  which 


NLINK  =  the  number  of  potential  links  in  the  network 

P[L^,  D^]  *  cost  function  for  pipes  (link)  as  a  function  of  the 
length  of  link  k  and  the  diameter 

NPUMP  =  the  number  of  pumps  in  the  network 

PU[XP  ,  QP  ]  *  cost  function  for  pumps  as  a  function  of  the  head  lift 
XP^  for  pump  m  and  the  flow  capacity  QPffl 

NSTOR  ■  the  number  of  elevated  storage  reservoirs  in  the 
system 

PS[XSn>  XV  ]  *  cost  function  for  storage  as  a  function  of  the  storage 

height  XS  and  the  storage  volume  XV 
n  n 

■  the  flow  rate  in  link  k 

AH.  *  the  total  head  loss  over  link  k 
k 
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=  constant  which  is  a  function  of  the  links  and  the  unit 
of  measurement  of  the  empirical  head  loss  equation 

=  external  flow  (demand)  at  node  i 

EL^  =  elevation  of  node  i 

HMIN^  =  the  minimum  head  level  at  node  i 

HMAX^  =  the  maximum  head  level  at  node  i 

20.  The  objective  function  (Equation  4)  is  composed  of  cost  functions 
for  link  cost,  pump  and  energy  cost,  and  storage  costs  which  can  be  expressed 
in  terms  of  uniform  annual  costs.  The  linear  system  of  Equation  5  ensures 
that  conservation  of  flows  is  satisfied  at  each  node.  Equation  6  determines 
the  head  loss  and  flow  direction  in  each  link.  Equation  7  states  that  the 
total  head  at  each  source  node  is  the  sum  of  the  nodal  heads  plus  the  head 
added  by  pumps  and  storage  reservoirs  located  at  the  node.  Equation  8  ex¬ 
presses  the  maximum  and  minimum  head  bounds.  The  nonnegativity  constraints 
for  the  flow  rate,  pump  head  lift,  storage  elevation,  and  diameter  are  given 
as  Equation  9. 

21.  The  water  distribution  problem  (expressed  as  Equations  4-9)  is  an 
optimization  problem  with  a  nonlinear  objective  function  and  nonlinear  con¬ 
straints.  The  problem  is  even  more  difficult  if  constraints  are  added  to 
describe  how  the  system  operates  under  emergency  loading  conditions  such  as 
fire  demand  and  broken  mains.  A  considerable  amount  of  research  has  addressed 
the  water  distribution  system  optimization  problem.  Walski  (1985a)  identifies 
over  60  optimization  programs  with  more  being  published  almost  daily.  All  of 
these  efforts  have  attempted  to  solve  the  problem  through  simplification  in 
order  to  apply  various  types  of  optimization  approaches.  At  present  there  is 
no  method  for  optimal  (minimum  cost)  design  of  looped  distribution  systems 
that  is  completely  satisfactory.  An  added  difficulty  in  the  optimal  design  of 
a  looped  system  is  that  the  requirement  for  redundancy  is  extremely  vague. 
Existing  solution  methods  also  make  no  real  attempt  to  explicitly  generate  and 
evaluate  network  layouts  in  terms  of  their  ultimate  impact  on  total  systems 
cost  and  on  reliability  of  water  service. 

22.  Numerous  models  have  been  developed  and  reported  in  the  literature. 

Several  of  these  models  have  only  considered  branching  type  networks,  includ¬ 
ing:  linear  programming  approaches  by  Karmeli,  f.adish,  and  Myers  (1968);  and 

dynamic  programming  by  Lai  (1970)  and  Swamee,  Kumar,  and  Khanna  (1973). 

23.  Several  models  considering  looped  systems  have  also  been  developed. 
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One  of  the  first  models  was  developed  by  Shamir  (1973)  in  which  the  decision 
variable  was  pipe  diameter.  The  objective  function  considered  a  single  load¬ 
ing  and  was  related  to  energy  loss  in  flow  through  all  the  pipes.  The  steady- 
state  hydraulic  solution  was  obtained  by  the  Newton-Rhapson  method  with  the 
Jacobian  of  the  solution  used  to  compute  the  components  of  the  gradient. 

24.  Several  later  models  for  looped  systems  are  listed  and  compared  in 
Table  2.  This  table  is  not  complete,  but  does  list  the  more  important  models 
that  have  been  developed.  It  is  obvious  that  the  water  distribution  optimiza¬ 
tion  problem  has  not  been  completely  solved  and  a  great  deal  of  work  could  be 
performed.  The  past  models  have  not  considered  multiple  loading  for  various 
emergency  conditions.  These  models  have  made  simplifications  in  cost  equa¬ 
tions  and  various  constraints  to  limit  the  size  of  the  optimization  model. 
There  are  rather  severe  limitations  to  the  size  of  problems  that  can  be  solved 
with  these  existing  models.  In  addition,  these  models  do  not  attempt  to 
incorporate  any  reliability  considerations  nor  do  they  explicitly  consider 
various  network  layouts. 

Approaches  Incorporating  Reliability  Analysis 

25.  The  previous  section  briefly  described  the  existing  models  that 
have  been  developed  for  the  minimum  cost  design  of  water  distribution  systems. 
The  emphasis  of  these  models  has  been  on  designing  the  systems  to  operate 
under  normal  loading  conditions.  Very  little  effort  has  been  reported  In  the 
literature  on  models  that  consider  emergency  loading  conditions  caused  by  such 
events  as  fire  demand,  pump  failure,  or  broken  pipes. 

26.  One  of  the  first  efforts  at  considering  emergency  loading  condi¬ 
tions  was  by  de  Neufville,  Schaake,  and  Stafford  (1971),  which  examined  four 
primary  measures  of  water  distribution  system  design:  (a)  overall  perfor¬ 
mance;  (b)  fail-safe  reliability;  (c)  distribution  of  performance;  and 

(d)  cost.  The  most  significant  part  of  this  effort  was  to  quantitatively 
evaluate  water  distribution  system  performance  (node  load  values)  under  rea¬ 
listic  emergency  loading  conditions  and  to  consider  the  cost/benefit  trade¬ 
offs  of  the  emergency  loading  conditions.  This  effort,  however,  did  not 
explicitly  consider  any  type  of  reliability  evaluation. 

27.  Damelin,  Shamir,  and  Arad  (1972)  were  the  first  to  explicitly  con¬ 
sider  reliability  measures  in  the  design  of  water  distribution  systems.  One 
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of  the  major  premises  of  this  work  was  that  reliability  has  an  economic  value. 
The  simulation  (reliability)  model  developed  by  these  investigators  was  used 
for  evaluating  the  reliability  of  supplying  a  known  demand  pattern  in  a  given 
water  supply  in  which  shortfalls  could  be  caused  by  random  failures  of  the 
pumping  equipment.  The  economic  model  developed  enabled  use  of  the  reliabil¬ 
ity  model  to  compute  the  cost  of  the  marginal  water  obtained  by  improving  the 
reliability  of  tne  system. 

28.  The  first  objective  of  Damelin,  Shamir,  and  Arad  (1972)  was  to 
devise  a  means  of  evaluating  the  reliability  of  a  given  set  of  pumping  equip¬ 
ment  serving  a  given  demand  considering  the  random  nature  of  pumping  equipment 
failure  and  repair  duration.  This  tool  is  then  used  to  examine  the  effect  on 
reliability  of  variations  in  the  design  and  maintenance  procedures  for  various 
demand  patterns.  The  reliability  relationships  for  various  designs,  mainte¬ 
nance  procedures,  and  demand  patterns  are  used  as  a  model  to  evaluate: 

(a)  least  cost  points  on  isoreliability  surfaces  which  represent  the  selection 
of  the  pumping  to  achieve  a  required  level  of  reliability  at  a  minimum  cost; 

(b)  computation  of  marginal  water  cost  associated  with  variations  in  design  or 
maintenance  procedures  to  improve  system  reliability  for  a  given  demand  pat¬ 
tern;  and  (c)  computation  of  marginal  water  costs  for  both  augmenting  a  water 
supply  and  maintaining  the  same  level  of  system  reliability. 

29.  The  Damelin,  Shamir,  and  Arad  (1972)  model  considered  pump  inter¬ 
failure  times  and  repair  durations  as  random  variables.  The  interfailure  time 
was  assumed  to  be  exponentially  distributed  and  the  repair  duration  was 
assumed  to  be  lognormally  distributed. 

30.  Shamir  and  Howard  (1981)  used  the  approach  of  defining  the  relia¬ 
bility  as  the  probability  that  a  reliability  factor  is  achieved.  The  reli¬ 
ability  factor  for  single  events  or  for  selected  time  periods  was  defined  in 
terms  of  capacity  lost  due  to  failure,  measured  as  a  fraction  of  the  demand 
rate  and/or  the  demand  volume.  Because  the  lost  capacity  is  a  random  vari¬ 
able,  then  the  reliability  factor  is  also  a  random  variable. 

31.  This  study  (Shamir  and  Howard  1981)  considered  two  components 
(reliability  factors)  to  define  the  overall  reliability:  (a)  a  discharge 
reliability  factor  RC  defined  as 


C  -  discharge  lost  due  to  failure 
CT  -  total  discharge  rate  required 
n  -  exponent 


and  (b)  a  volume  reliability  factor  RV  ,  defined  as 


RV  -  1 


V_ 

VT 


C  x  D 
CT  x  D 


(ID 


where 

V  *»  shortfall  volume 

VT  »  total  volume  required  during  the  failure  event  time  period 

V  is  simply  C  x  D  where  D  is  the  duration  of  failure  and  VT  is  simply 
CT  x  D  .  Shamir  and  Howard  (1981)  then  expressed  the  reliability  factor  as 


RF 


RC  +  RV 
,  2 


32.  The  probability  density  function  pdf  of  RF  was  defined  as 


fRF<r> 


fc(c) 


2CTn 

CTn_1  +  nCn_1 


fc(c) 


(13) 


where  fc(c)  *  pdf  of  the  lost  capacity. 

33.  Shamir  and  Howard  (1981)  discussed  the  use  of  both  a  postulated  and 
an  exponential  probability  density  function  for  the  lost  capacity.  The  postu¬ 
lated  pdf  was  expressed  as 

fc(e)  ■  ir  (‘  -  if)”  <u> 

where  m  >  1  •  parameter  determined  from  the  data.  The  exponential  distri¬ 
bution  is  expressed  as 


34.  The  model  can  be  extended  to  determine  the  length  of  time  it  takes 

before  the  reliability  factor  drops  below  a  reference  value  of  the  reliability 

factor  RF  .  For  annual  values  the  return  period  is 
o 


1 

AP  (RF  <  RF  ) 
o 


»frf(rpo) 


(16) 


where  A  is  the  reciprocal  of  the  mean  time  between  failures  and  is  the  aver¬ 
age  number  of  failures  per  year.  The  above  equation  can  be  used  to  develop 
reliability  factor-frequency  curves  which  could  be  used  to  compare  the  reli¬ 
ability  of  alternative  systems. 

35.  The  probability  that  a  storage  volume  Sq  can  supply  the  demand 
during  a  failure  event  is 


P(C  x  D  <  S  ) 
o 


fc(d)  dc  fD(d)  dd 


(17) 


assuming  that  the  lost  capacity  during  failure  and  the  repair  duration  are 
statistically  independent.  Considering  lost  capacities  below  some  specified 
value  Cq  ,  the  corresponding  reliability  factor  is  RFq  .  The  reliability  is 
then  expressed  as  a  function  of  the  reliability  factor  (a  measure  of  lost 
capacity)  and  the  storage  as 

R(RFq,  Sq)  =  P^RF  <  RFq  |  S  =  =  P^C  x  D  <  Sq  |  C  <  (18) 


36.  The  reliability  can  also  be  expanded  to  consider  standby  pumping 
capacity  SBCq  as 


R(RF  ,  S  ,  SBC  ) 
o  o  o 


J f  fc(c)  dc  fD(d)  dd 
$ 


(19) 


The  region  of  integration  <J  is  illustrated  in  Figure  1  and  the  trade-off 
between  storage  and  standby  pumping  capacity  to  achieve  a  specified 


reliability  for  fixed  reliability  factors  is  illustrated  in  Figure  2. 

37.  For  each  reliability  curve  it  is  possible  to  compute  the  cost  of 
the  combinations  of  storage  and  standby  pumping  capacity  to  compute  the  least- 
cost  combinations  as  shown  in  Figure  2.  Relationships  can  then  be  plotted  for 
the  minimum  cost  versus  reliability  R  for  fixed  reliability  factors  RF  as 
shown  in  Figure  3,  and/or  the  minimum  cost  versus  RF  for  fixed  values  of  R 
as  shown  in  Figure  4. 

Approaches  Incorporating  Reliability  and  Maintenance 

38.  Several  articles  have  been  reported  in  the  literature  dealing  with 
criteria  and  methods  for  analyzing  the  replacement  and/or  repair  of  water  dis¬ 
tribution  pipes.  The  aging  of  pipes  causes  a  decrease  in  their  carrying  ca¬ 
pacity  and  they  become  more  prone  to  breakage.  The  reasons  for  breaks  in¬ 
clude:  quality  and  age  of  the  pipe;  quality  of  installation;  type  of  environ¬ 
ment,  e.g.  external  corrosion,  temperature  conditions,  frost  conditions,  soil 
movement,  external  loads,  etc;  and  service  conditions  such  as  operating  pres¬ 
sures  and  water  hammer. 

39.  The  cost  of  pipe  breaks  can  represent  a  significant  portion  of  the 
maintenance  costs  for  a  water  distribution  system.  As  a  result  the  decision 
of  whether  to  repair  or  replace  the  pipe  must  be  considered  from  an  economic 
viewpoint.  Stacha  (1978)  outlined  several  criteria  that  should  be  considered 
in  the  final  decision  to  replace  a  water  main.  These  criteria  include  com¬ 
parison  of  annual  costs  for  repair  and  replacement;  adequacy  of  the  capacity 
of  the  existing  main;  effect  on  water  quality  by  the  internal  condition  of  the 
pipe;  hazards  of  existing  main  to  the  safety  of  persons  and  property;  effect 
of  Increasing  demands  on  the  existing  pipes;  frequency  of  failure;  and  street 
conditions. 

40.  Shamir  and  Howard  (1979)  developed  a  procedure  to  schedule  pipe 
replacement  based  upon  forecasted  number  of  breaks  of  existing  pipe;  fore¬ 
casted  number  of  breaks  in  the  new  pipe;  cost  of  repairing  one  break;  cost  of 
replacing  the  existing  pipe;  and  the  discount  rate.  Regression  equations  of 
the  form 


A(t-t  ) 


where 


N(t)  =  number  of  breaks  per  1,000-ft  length  of  pipe  in  year  t 
t  =  time  in  years 

t  ■  base  year  when  pipe  was  installed 
A  =  growth  rate  coefficient 

41.  The  cost  of  repairing  the  breaks  in  1,000  ft  of  pipe  in  year  t 
was  expressed  as 


A(t-t  ) 

C  (t)  =  C.N(t)  =  C,N(t  ) e 
m  b  bo 


(21: 


where  C,  is  the  cost  of  repairing  a  break.  The  present  value  of  all  mainte- 
b 

nance  cost  P  (t  )  from  the  year  t  to  year  t  for  1,000  ft  of  pipe  is 
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(22: 


where  R  is  the  discount  rate. 

42.  The  present  value  in  year  t  of  replacing  1,000  ft  of  pipe  in 
year  tf  is 
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where  is  the  cost  of  replacement.  The  optimal  timing  for  replacement  is 
the  time  t  that  minimizes  the  total  present  cost  which  is  stated  mathemati 
cally  as: 
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Differentiating  with  respect  to  t  and  solving  for  tf  gives 


t  = 
r 


t  +x  In 
o  A 


In  (1  +  R)C 
N<to)Cb 


(25) 


For  a  linear  increase  in  breaks  given  by, 


N(t)  -  N(t  )  +  A(t  -t  ) 
o  p  o 

the  optimal  time  for  replacement  is  expressed  as 

In  (1  +  R)Cr 
Cr  “  Co  +  AN (t  )C. 

O  D 


(26) 


(27) 


43.  Walski  and  Pelliccia  (1982)  stated  that  the  exponential  Equation  20 
fits  certain  data  better  than  the  linear  Equation  26.  A  break  prediction 
model  was  used  of  the  form. 


N(t) 


CjC2ae 


b(t-k) 


(28) 


where 

N(t)  =  break  rate  per  year  per  mile 

0^  =  correction  factor  for  previous  breaks 

C2  “  correction  factor  for  pipe  size 

a  and  b  *  regression  coefficients 

k  *  year  of  pipe  installation  so  that  (t  -  k)  is  the  age  of  the 
pipe. 


44.  Walski  and  Pelliccia  (1982)  developed  a  new  criterion  to  replace 
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pipes  which  stated  that  if  a  pipes  current  break  rate  J  is  greater  than  some 
critical  break  rate  J*  ,  the  pipe  should  be  replaced.  The  criterion  was 
stated  as 


J  >  J*  = 


5280  CrL  ln  Vr+RV 


1  +  R 


-  1 


(29) 


where 

J  and  J*  =  breaks  per  year  per  mile 

=  replacement  cost 

L  =  fraction  of  pipe  replaced 

b  =  regression  coefficient 

R  =  interest  rate 

C.  =  cost  of  a  break 
b 

m  =  period  of  analysis,  years 

Walski  and  Pelliccia  (1982)  concluded  that  the  Shamir  and  Howard  (1979) 
approach  (Equation  24)  is  useful  for  deciding  whether  to  replace  entire  groups 
of  pipes  but  Equation  29  is  more  useful  in  analyzing  the  economic  replacement 
on  a  pipe-by-pipe  basis. 

45.  Walski  and  Pelliccia  (1982)  also  presented  two  simplifications  to 
Equation  29.  The  first  is 


J*  = 


5280  C  L  ln 
r 


(K 


(30) 


iL 


R,v 

'•  .‘‘A 


fe-T 


in  which  b  is  significant,  but  the  break  rate  of  the  future  pipe  need  not  be 
considered  such  as  for  most  unprotected  pipes  in  fairly  corrosive  soils.  The 
second  is 


J* 


5280  CM.  ln  (1  +  R) 


(31) 


t— 


in  which  b  =  0  ,  i.e.  the  break  rate  is  not  changing  significantly  with  time. 
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which  is  the  case  for  most  nonmetallic  pipes  or  metallic  pipes  in  noncorrosive 
soil. 

46.  Clark,  Stafford,  and  Goodrich  (1982)  used  the  analysis  by  Shamir 
and  Howard  (1979)  to  define  the  number  of  maintenance  events  in  a  given  sec¬ 
tion  of  pipe  as 

GR(t-t  ) 

N(t)  =  Ke  0  (32) 

where 

N(t)  =  number  of  maintenance  events 
K  *  a  constant 

GR  -  growth  rate  coefficient 
t  =  number  of  years  from  installation 

t  =  number  of  years  from  installation  to  the  first  maintenance  event 

The  optimal  time  to  replacement  was  the  same  as  the  Shamir  and  Howard  (1979) 
approach.  This  effort  also  developed  regression  equations  for  the  number  of 
years  from  installation  to  first  repair  and  for  the  number  of  repairs. 

47.  Walski  (1982,  1985b)  considered  the  economic  analysis  of  the  reha¬ 
bilitation  of  water  mains.  This  analysis  considered  that  the  cleaning  and 
relining  of  water  transmission  lines  is  economical  if  the  costs  are  less  than 
the  savings  in  energy  and  pumping  capacity  which  occur  because  of  the  in¬ 
creased  carrying  capacity  of  the  pipe.  The  criteria  developed  can  be  used  to 
determine  if  it  is  economical  to  clean  and  relign  a  pipe  for  two  cases:  flow 
is  not  significantly  changed  by  rehabilitation  of  the  pipe,  or  the  system  is 
looped  so  that  the  change  in  carrying  capacity  significantly  changes  flow. 

48.  The  decision  criterion  was  to  rehabilitate  if  the  cost  for  rehabil¬ 
itation  is  less  than  the  extra  cost  for  pumping  energy  and  additional  equip¬ 
ment  required  to  force  water  through  the  main  with  a  low  Hazen-Williams 
C-factor,  stated  as 

c  <  (c  -  c  )  +  (c  -  c  ) s pwf  (33) 

r  p  p  e  e 

where 

c^  **  cost  of  rehabilitation 

c  *  cost  of  pumping  equipment  required  with  no  rehabilitation 


Cp  =  cost  of  pumping  equipment  with  rehabilitation 

=  cost  of  energy  loss  in  pipe  without  rehabilitation 
c^  =  cost  of  energy  loss  in  pipe  with  rehabilitation 
spwf  =  series  present  worth  factor  for  energy  cost 
A  rehabilitation  index  R  was  defined  as 


R=(c  -  c  )  +  (c  -c)  spwf  -  c 

P  p  g  0  ‘  J* 


The  cost  of  energy  loss  cg  -  cg  was  derived  as 


-  °-lm  of’85 ./  1  1 

'  d‘-87E  Ic1-85  'c1-85 


where 

Q  =  average  flow,  gal/min 
D  =  diameter,  in. 

E  =  wire-to-water  efficiency 
P  =  price  of  energy,  c/kwhr 
C  =  Hazen-Williams  coefficient 

C  =  Hazen-Williams  coefficient  after  rehabilitation 
The  cost  savings  (cost  of  pumping)  by  rehabilitating  the  pipes  was  expressed 


‘•V-'.v*- 


where 


F  =  (pwf)(dc/dh)  =  rate  of  change  in  cost  per  change  in  head  loss 

pwf  =  present  worth  factor  based  on  the  year  that  additional  pump 
capacity  is  required 

dc/dh  =  rate  of  change  in  cost  per  change  in  head  loss 
Ahp  =  increase  in  head  provided  by  pumps 


c  =  FAh  = 
P  P 


10.43  Q 


49.  Equations  35  and  36  can  be  substituted  Into  Equation  34  to  express 


R  as 


10.43  C* 
-4.87 


.  Q^‘85P(spwf)  (0.0164) 

Q„  F  +  — - p - 

P  E 


-  c 


In  which 


C* 


(37) 


(38) 


The  above  models  were  used  to  develop  graphs  of  the  various  parameters  versus 
degree  of  rehabilitation  as  shown  in  Figure  5.  The  approach  can  be  used  to 
determine  if  it  is  economical  to  rehabilitate  pipes  based  on  the  savings  in 
energy  and  pumping  capacity.  Walski  (1982)  emphasizes  that  this  approach  is 
appropriate  for  lines  fed  directly  from  pumps  but  not  small  distribution  lines 
sized  for  fire  flow  and  being  fed  from  storage  tanks. 
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PART  III:  COMPONENT  RELIABILITY  ANALYSIS 


Definitions  of  Terms 


50.  The  analysis  of  reliability  and  availability  requires  an  under¬ 
standing  of  some  basic  terms,  which  are  defined  in  this  section.  The  concepts 
represented  by  these  terms  will  be  used  in  later  sections  to  quantify  relia¬ 
bility  and  availability. 

Reliability 

51.  The  reliability  R(t)  of  a  component  is  defined  as  the  probability 
that  the  component  experiences  no  failures  during  the  time  interval  from  time 
zero  to  time  t(t,o)  ,  given  that  it  is  new  or  repaired  at  time  zero.  Relia¬ 
bility  can  be  defined  mathematically  as: 


00 

R(t)  =  /  f(t)  dt  (39) 

t 

where  f(t)  **  probability  density  function  of  the  time  to  failure.  Values  for 
R(t)  range  between  0  and  1.  The  probability  density  function  f(t)  may  be 
developed  from  equipment  failure  data,  using  various  statistical  methods;  how¬ 
ever,  in  most  cases  an  exponential  distribution  is  assumed. 

Unreliability 

52.  The  unreliability  F(t)  of  a  component  is  defined  as  the  probabil¬ 
ity  that  the  component  will  fail  by  time  t  .  Unreliability  can  be  defined 
mathematically  as: 


t 

F(t)  -  f  f(t)  dt  (40) 

J0 


Failure  rate 

53.  The  failure  rate  r(t)  is  the  probability  that  a  component  experi¬ 
ences  a  failure  per  unit  of  time  t  given  that  the  component  was  operating  at 
time  zero  and  has  survived  to  time  t  .  The  quantity  r(t)  dt  is  the  proba¬ 
bility  that  a  component  fails  during  time  (t,t+dt)  .  Values  for  r(t)  dt 
range  from  0  to  1 . 


.  -•  _■  -■  _*  V 
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Mean  time  to  failure 


54.  The  mean  time  to  failure  MTTF  is  the  expected  value  of  the  time 
to  failure,  stated  mathematically  as: 

00 

MTTF  -  f  t  x  f(t)  dt  (41) 

o 

The  MTTF  is  usually  expiessed  in  hours. 

Probability  of  repair 

55.  The  probability  of  repair  G(t)  is  the  probability  that  the  com¬ 
ponent  repair  is  completed  before  time  t  ,  given  that  the  component  failed  at 
time  zero.  Note  that  the  repair  process  starts  with  a  failure  at  time  zero 
and  ends  at  the  completion  of  the  repair  at  time  t  . 

Repair  rate 

56.  The  repair  rate  g(t)  is  the  probability  that  the  component  is 
repaired  per  unit  time  at  time  t  given  that  the  component  failed  at  time 
zero  and  is  still  not  repaired  at  time  t  .  The  quantity  g(t)  dt  is  the 
probability  that  a  component  is  repaired  during  time  (t,t+dt)  given  that  the 
components  failure  occurred  at  time  t  . 

Mean  time  to  repair 

57.  The  mean  time  to  repair  MTTR  is  the  expected  value  of  the  time  to 
repair  a  failed  component.  MTTF  is  defined  mathematically  as: 

00 

MTTR  =  f  t  *  g(t)  dt  (42) 

o 

where  g(t)  is  the  probability  density  function  for  the  repair  time.  The 
MTTR  is  usually  expressed  in  hours. 

Mean  time  between  failures 

58.  The  mean  time  between  failures  MTBF  is  the  expected  value  of  the 
time  between  two  consecutive  failures.  For  a  repairable  component,  the  MTBF 
is  defined  mathematically  as: 


MTBF  =  MTTF  +  MTTR 


(43) 


59.  The  mean  time  between  repairs  MTBR  is  the  expected  value  of  the 
time  between  two  consecutive  repairs  and  equals  the  MTBF  . 

Availability 

60.  The  availability  A(t)  of  a  component  is  the  probability  that  the 

component  is  in  operating  condition  at  time  t  ,  given  that  the  component  was 
as  good  as  new  at  time  zero.  The  reliability  generally  differs  from  the 
availability  because  reliability  requires  the  continuation  of  the  operational 
state  over  the  whole  interval  (0,t)  .  Subcomponents  contribute  to  the  avail 
ability  A(t)  but  not  to  the  reliability  R(t)  if  the  subcomponent  that 
failed  before  time  t  is  repaired  and  is  then  operational  at  time  t  .  As  a 

result,  the  availability  A(t)  is  always  larger  than  or  equal  to  the  relia¬ 
bility  R(t)  ,  i.e.  A(t)  >  R(t)  .  For  a  nonrepairable  component,  it  is  oper 

atlonal  at  time  t  if  and  only  if  it  has  been  operational  to  time  t  ,  i.e. 

A(t)  “  R(t)  for  nonrepairable  components.  As  shown  in  Figure  6,  the  avail¬ 
ability  of  a  nonrepairable  component  decreases  to  zero  as  t  becomes  larger, 
whereas  the  availability  of  a  repairable  component  converges  to  a  nonzero 
positive  number. 


Figure  6.  Availability  for  repairable  and 
nonrepairable  components 


Unavailability 

61.  The  unavailability  Q(t)  at  time  t  is  the  probability  that  a 
component  is  in  the  failed  state  at  time  t  ,  given  that  it  started  in  the 
operational  state  at  time  zero.  The  Q(t)  is  less  than  or  equal  to  the 
unreliability  F(t)  and  for  nonrepairable  components  they  are  equal.  Because 
a  component  is  either  in  the  normal  state  or  failed  state  at  time  t  ,  then 

A(t)  +  Q(t)  =  1  (44) 


Conditional  failure  intensity 

62.  Conditional  failure  intensity  A(t)  is  the  probability  that  a  com¬ 
ponent  fails  per  unit  time  at  time  t  ,  given  that  it  is  in  the  operational 
state  at  time  zero  and  is  operational  at  time  t  .  The  quantity  ^(t)  dt  is 
the  probability  that  a  component  fails  during  a  small  time  interval  (t,t+dt) 
given  that  the  component  was  as  good  as  new  at  time  zero  and  operational  at 
time  t  .  The  quan  ..ty  r(t)  dt  is  the  probability  that  a  component  fails 
during  the  time  interval  given  that  the  component  was  repaired  at  time  zero 
and  has  been  operational  to  time  t  .  The  quantities  l(t)  dt  and  r(t)  dt 
differ  because  r(t)  dt  assumes  the  continuation  of  the  operational  state  to 
time  t  or  that  no  failure  occurred  in  the  interval  (0,t)  ,  whereas  Mt)  dt 
only  assumes  that  the  component  is  operational  at  time  t  ,  i.e.  intermediate 
failures  between  time  zero  and  time  t  are  not  important  to  the  calculation. 


Mt)  =  r(t) 

general  case 

(43) 

Mt)  =  r  (t) 

nonrepairable  component 

(46) 

X(t)  =  r 

constant  failure  rate  r 

(47) 

Unconditional  failure  intensity 

63.  The  unconditional  failure  intensity  w(t)  is  the  probability  that 
a  component  fails  per  unit  time  at  time  t  ,  given  that  it  started  in  the 
operational  state  at  time  zero.  The  unconditional  failure  intensity  is 
obtained  from  the  analysis  of  equipment  failure  data  (Henley  and  Kumamoto 

1981). 

Expected  number  of  failures 

64.  The  expected  number  of  failures  W(t,t+dt)  ,  given  that  the  compo¬ 
nent  started  into  the  operational  state  at  time  zero,  is  defined  as 


For  a  nonrepairable  component,  W(0,t)  =  F(t)  and  approaches  unity  as  t 
gets  larger.  For  a  repairable  component,  W(0,t)  diverges  to  infinity  as  t 
becomes  larger.  Typical  curves  of  W(0,t)  are  shown  in  Figure  7. 
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Figure  7.  Expected  number  of  failures  for  repairable  and 
nonrepairable  components 


Conditional  repair  Intensity 

65.  The  conditional  repair  intensity  u(t)  is  the  probability  that  a 
component  is  repaired  per  unit  time  at  time  t  ,  given  that  it  started  into 
the  normal  state  at  time  zero  and  failed  at  time  t  .  The  repair  rate  m(t) 
and  u(t)  differ  in  a  manner  similar  to  the  relation  between  A(t)  and 
r (t)  . 


u(t)  -  m(t)  *  0  nonrepairable  component 
u(t)  *  m  constant  repair  rate  m 


(AS 
( 5  C 


> 

I 

Unconditional  repair  intensity 

66.  An  unconditional  repair  intensity  v(t)  is 

|  component  is  repaired  per  unit  time  at  time  t  ,  given 

the  operational  state  at  time  zero. 

Expected  number  of  repairs 

67.  Expected  number  of  repairs  during  (t,t+dt) 

|  ponent  started  into  the  operational  state  at  time  zero 

t+dt 

V(t,t+dt)  =  f  v(t)  dt 

For  a  nonrepairable  component  V(0,t)  =  0  and  for  a  repairable  component, 
V(0,t)  +  °“  as  t  gets  larger. 

Density  functions 

68.  The  common  thread  in  the  analysis  of  reliability  and  availability  is 
the  selection  of  an  appropriate  failure  density  function.  A  probability  den¬ 
sity  function  or  simply  a  density  function  is  a  function  that  associates  with 
each  value  of  a  random  variable  the  probability  that  this  value  will  be 
assumed.  It  is  a  theoretical  model  for  the  frequency  distribution  of  a  popu¬ 
lation  of  measurements  (Mendenhall,  Scheaffer,  and  Wackerly  1981).  Density 
functions  are  used  to  model  a  variety  of  reliability-associated  events  includ¬ 
ing  time  to  failure  and  time  to  repair.  Some  of  the  more  common  failure  dens¬ 
ity  functions  used  in  reliability  analysis  and  their  associated  unreliability, 
failure  rate,  and  mean  time  to  failure  functions  are  presented  in  Table  3. 

69.  Because  of  its  relative  simplicity  for  performing  reliability  anal¬ 
yses,  the  exponential  distribution  is  probably  the  most  widely  used  failure 
density  function.  The  exponential  density  function  is: 

f(t)  =  Xe_Xt  t  >  0  ,  X  >  0  (52) 

where  X  is  a  constant  failure  rate.  The  reliability  is  simply 

R(t)  =  e_At  (53) 


the  probability  that  a 
that  it  started  into 

given  that  the  com- 
is 

(51) 


The  MTTF  is 


(54) 


MTTF  -  f  t  x  \e~Xt  dt  -  ~ 

0  X 


The  mean  time  to  repair  can  also  be  defined  using  the  exponential  density 
function 


t(t)  -  ue  ut 


(55) 


so  that 


MTTR  -  f  t  x  ue"Ut  dt 


_L_ 

u 


(56) 


The  MTTR  can  be  estimated  using  an  arithmetical  mean  of  the  time  to  repair 
data  for  various  types  of  components. 


Parameter  Relationships 

70.  Henley  and  Kumamoto  (1981)  developed  a  number  of  relationships 
between  the  various  reliability  (availability)  parameters.  The  more  signifi¬ 
cant  of  these  are  summarized  in  Table  4. 

Availability  and  unavailability 
for  constant  failure  and  repair  rates 

71.  For  a  constant  failure  rate  and  a  constant  repair  rate  the  analysis 
of  the  whole  process  can  be  simplified  to  analytical  solutions.  Henley  and 
Kumamoto  (1981)  used  Laplace  transforms  to  derive  the  unavailability  as 

«‘>  ‘  TTT  <”> 

and  the  availability 


A(t) 


i  -  Q(t)  -  rrr  + 


X  + 


-(A+u) t 


(58) 
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The  stationary  unavailability  Q(®)  and  the  stationary  availability  A(“) 
are 


u 


MTTR 

MTTF  +  MTTR 


(59) 


A(°°) 


u  _  MTTF 
X  +  u  ~  MTTF  +  MTTR 


(60) 


The  following  relation  is  also  true 


Q(t)  .  -(Hu)t 
Q(“) 


(61) 


A  summary  of  the  other  relations  for  the  whole  process  for  constant  failure 
rates  is  given  in  Table  4. 

Relations  for  repair-to-failure 
and  failure-to-repair  processes 

72.  The  probability  that  a  component  failure  occurs  during  time 
interval  (t,t+dt)  ,  represented  by  A  ,  given  that  the  component  has  been 
normal  to  time  t  (represented  by  C  )  and  the  component  was  repaired  at  time 
zero  (represented  by  W  )  is  expressed  as 


P [ A | C , W] 


P  [  A ,  C  |  W  ] 
P[C|WJ 


(62) 


where 

P [A,C | W ]  =  f(t)  dt 

P[C|W]  =  the  reliability  R(t) 

Then  P [ A | C , W ]  =  r(t)  dt  so  that 


r ( t)  dt 


f(t)  dt 
R(t) 


(63) 


or 


r  ( t ) 


f(t)  _  f(t) 
R(t)  1  -  F(t) 


(64) 
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The  reliability  is  then 


“  -l 

t 

R(t)  -  exp  -  f  r(t)  dt 
^0 


and  the  unreliability  is  simply  F(t)  =  1  -  R(t)  .  The  density  function  is 


"  t 

f(t)  =  r (t)  exp  -  f  r (t)  dt 

Jo 


73.  Similar  to  the  repair-to-failure  process,  relations  can  be  derived 
for  the  failure-to-repair  process: 


1  -  G(t) 


G(t)  =  1  -  exp  -  f  m(t)  du 

J0 


g(t)  -  m(t)  exp 


t 

/  m(u)  du 

Jo 


Analysis  of  the  whole 
rocess  using  time  to  failure  data 


74.  When  time  to  failure  data  are  known,  commonly  used  distributions 
can  be  employed  in  the  analysis.  For  components  which  have  an  increasing 
failure  rate  with  time,  the  normal,  lognormal,  or  the  Weibull  distribution 
with  a  shape  parameter  larger  than  unity  could  apply. 

75.  The  following  analysis  will  assume  that  the  Weibull  distribution 
applies  to  the  increasing  failure  rate  for  water  mains.  Properties  of  this 
distribution  are  given  in  Table  3.  The  unreliability  is  expressed  as 


i 


V. 


v 

i 


where 

Y  =  time  when  the  component  begins  to  fail 
9  **  characteristic  life 

8  =  shape  parameter  of  the  Weibull  slopes 

For  practical  reasons,  it  is  convenient  to  assume  that  y  =  0  so  that  the 

above  is  reduced  to 


F(t)  =  1  -  exp 


[-«)'] 


(71) 


which  reduces  to 


log  / log 


8  log  t  -  B  log 


(72) 


so  that  | log  1 / [ 1  -  F(t)]|  plots  as  a  straight  line  against  log  t  with 
slope  g  and  y-intercept  of  -g  log  0  .  To  use  the  Weibull  distribution,  it 
is  necessary  to  estimate  the  two  parameters  6  and  g  using  time  to  failure 
data.  The  data  are  plotted  on  Weibull  paper  and  a  straight  line  is  fitted  to 
the  data. 

76.  The  mean  time  to  failure  is 


MTTF 


and  the  failure  rate  is 


-/ 


t  *  f(t)  dt  =  y  +  6  *  r 


1  +  6 


(73) 


r(t)  -  -  r> 

e8 


6-1 


(74) 


The  mean  of  the  Weibull  distribution  is 


Mean  =  er  j 1  +  - 

\  6 


(75) 


and  the  variance  is 
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77.  The  unconditional  intensities  w(t)  and  v(t)  can  be  calculated 
using  the  following  relationships  (Henley  and  Kumamoto  1981).  For  the  uncon¬ 
ditional  failure  intensity, 


t 

w(t)  =  f(t)  +  f  f(t  -  u)v(u)  du  (7 

J0 


and  for  the  unconditional  repair  intensity 


v(t) 


t 

f  g(t  -  u)w(u) 
0 


du 


(78 


These  intensities  can  be  determined  using  either  iterative  numerical  integra¬ 
tion  or  Laplace  transform  techniques  when  f(t)  and  g(t)  are  known.  The 
unavailability  for  the  whole  process  can  be  expressed  as 

Q(t)  *  W(0, t)  -  V(0,t)  ( 


t 

Q(t)  =  f  [w(u)  -  v(u)]  du 
0 


The  conditional  failure  intensity  can  be  expressed  as 


A(t) 


W(t) 

1  -  Q(t) 


(80 


(81 


and  the  conditional  repair  intensity  is  expressed  as 


Data  Requirements  and  Availability 


78.  Data  requirements  for  application  of  the  various  failure  density 
functions  usually  include  a  determination  of  the  MTTF  ,  MTTR  ,  and  any 
associated  preventative  maintenance  requirements.  Unfortunately,  detailed 
data  of  this  nature  usually  are  not  readily  available.  Most  of  the  data  col¬ 
lected  on  the  reliability  of  mechanical  equipment  have  been  for  aircraft  or 
nuclear  power  systems,  which  have  substantially  more  stringent  design,  manu¬ 
facturing,  and  maintenance  standards  than  similar  equipment  installed  by  water 
utilities.  The  most  comprehensive  set  of  reliability  data  for  equipment  of 
similar  design  and  service  to  that  found  in  water  utilities  appears  to  have 
been  developed  by  Shultz  and  Parr  (1981)  for  EPA.  Among  other  items  of  equip¬ 
ment,  Shultz  and  Parr  collected  reliability  data  on  pumps,  motors,  controls, 
power  transmission,  and  valves.  Reliability  and  maintainability  data  as  col¬ 
lected  by  Shultz  and  Parr  (1981)  are  presented  in  Tables  5-7.  Although  the 
reported  MTBF  values  appear  high  and  corresponding  MTTR  values  appear  low, 
Shultz  reported  that  many  units  operated  for  up  to  10  years  without  failure. 

In  addition,  the  time  to  repair  value  is  based  upon  the  active  man-hours 
required  to  disassemble,  correct,  and  reassemble  the  unit.  Time  expended 
waiting  for  materials  and  manpower  scheduling  was  not  included.  Obviously, 
these  items  could  have  a  significant  impact  on  the  MTTR  value. 

79.  The  data  collected  by  Shultz  and  Parr  (1981)  were  compared  with 
reliability  data  on  similar  subsystems  available  from  two  other  sources: 
Reliability  Analysis  Center  (1981)  and  Southwest  Research  Institute  (1978). 

The  results  of  this  comparison  are  presented  in  Table  8.  For  most  subsystems, 
the  other  data  bases  contained  substantially  more  operating  hours  than  did  the 
Shultz  and  Parr  survey.  For  all  subsystems,  the  Shultz  and  Parr  data  reflect 
the  lowest  MTBF  .  This  result  is  expected  since  the  subsystems  reported  in 
the  nuclear  plant  reliability  data  system  (Southwest  Research  Institute  1978) 
are  all  in  the  safety  class  of  the  nuclear  steam  supply  system.  In  addition, 
many  of  the  components  from  the  nonelectric  parts  reliability  data  (Reliabil¬ 
ity  Analysis  Center  1981)  were  qualified  to  military  specifications  and  were 
used  in  military  applications.  The  added  safety  and/or  reliability  require¬ 
ments  for  components  in  these  types  of  application  probably  contributed  to  the 
higher  MTBF  values. 


Stress-Strength  Analysis 


80.  Strength  of  a  component  is  defined  as  the  ability  of  the  component 
to  accomplish  its  required  mission  satisfactorily  without  a  failure  when  sub¬ 
jected  to  an  external  stress.  Stress  is  the  loading  of  the  component,  which 
may  be  mechanical  load,  environment,  flow  rate,  temperature,  etc.  The  stress 
(load)  tends  to  produce  failure  of  the  component.  When  the  strength  of  the 
component  is  less  than  the  stress  imposed  on  it,  the  failure  occurs.  This 
type  of  analysis  can  be  applied  to  the  reliability  analysis  of  components  of 
water  distribution  systems. 

Static  reliability  models 

81.  The  reliability  of  a  hydraulic  system  is  defined  as  the  probability 
of  the  strength  or  resistance  y  exceeding  the  stress  or  loading  x  ,  i.e., 
the  probability  of  survival.  The  terms  "stress"  and  "strength"  are  more  mean¬ 
ingful  to  structural  engineers,  whereas  the  terms  "loading"  and  "resistance" 
are  more  descriptive  to  water  resources  engineers.  The  risk  of  a  hydraulic 
component,  subsystem,  or  system  is  defined  as  the  probability  of  the  loading 
exceeding  the  resistance,  i.e.,  the  probability  of  failure.  The  mathematical 
representation  of  the  reliability  R  and  the  risk  R  can  be  expressed  as, 
respectively. 


R  *  P^(y  >  x)  *  P^(y  -  x  >  0) 


(83) 


and 


R  -  Pr(y  <  x)  =  Pf(y  -  x  <  0)  (84) 

in  which  P^(  )  refers  to  probability.  The  relationship  between  reliability 
and  risk  is 


R  -  1  -  R  (85) 

82.  The  resistance  of  a  hydraulic  structure  is  essentially  the  capacity 
of  the  structure  and  the  loading  is  essentially  the  magnitude  of  flows  through 
or  over  the  structure,  e.g.,  from  a  hydrologic  event.  Since  the  loading  and 
resistance  are  random  variables  due  to  the  various  hydraulic  and  hydrologic 
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uncertainties,  a  knowledge  of  the  probability  distributions  of  y  and  x  is 
required  to  develop  risk  and  reliability  models.  The  computation  of  risk  and 
reliability  can  be  referred  to  as  "loading  resistance  interference"  as  shown 
in  Figure  8.  The  intersection  of  the  load  resistance  curves  demonstrates  the 
interaction  of  two  composite  stochastic  processes.  The  reliability  is  the 
probability  that  the  resistance  is  greater  than  the  loading  for  all  possible 
values  of  the  loading. 


*,y 


Figure  8.  Typical  load-resistance  interference  diagram 

83.  The  word  "static,"  from  the  reliability  computation  point  of  view, 
represents  the  worst  single  stress,  or  load,  applied.  The  loading  applied  to 
many  hydraulic  structures  is  random  variable,  and,  in  addition,  the  number  of 
times  of  loading  is  unknown  and  random.  If  the  distribution  of  the  loading 
and  resistance  is  known,  the  reliability  computation  is  referred  to  as  a  load- 
resistance  interference  diagram  (Figure  8).  Following  the  definitions  given 
in  Equations  83  and  84,  the  reliability  and  risk  of  a  hydraulic  structure  can 
be  expressed  as,  respectively 


and 


R  «  1  -< 

f 

1  00 

/  £y<y) 

/  fx(x)  dx 

dy; 

1 

1^  y 

L y  J 

) 

in  which  f  (  )  and  f  (  )  represent  the  probability  density  functions  of 

y  x 

resistance  and  loading,  respectively.  The  reliability  computations  for  a 
hydraulic  structure  for  the  static  case  require  the  probability  distributions 
of  loading  and  resistance.  The  safety  factor  SF  is  defined  as 
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in  which  the  bar  denotes  the  mean.  This  type  of  SF  has  been  referred  to  as 
a  characteristic  SF  by  Yen  (1978). 

84.  In  cases  in  which  both  the  stress  x  and  the  strength  y  are 
exponentially  distributed,  i.e. 


f  (x)  *=  X  e 
x  x 


-A  y 

f  (x)  -  A  e  y 

y  y 


then  the  static  reliability  reduces  to  (Kapur  and  Lamberson  1977) 


A  +  A 

y  * 


85.  In  cases  in  which  both  the  stress  x  and  strength  y  are  log- 
normally  distributed,  the  reliability  is  computed  as  (Kapur  and  Lamberson 
1977) 


'-.■'-.--.•I 

m  i 


(92) 


+z 

R  =  J"  $(z)  dz 


where  <J> ( z )  is  the  probability  density  function  for  the  standard  normal 
deviate  z  given  as 

In  -  In 


86.  In  cases  in  which  the  stress  x  is  exponentially  distributed  and 
the  strength  is  normally  distributed,  the  reliability  can  be  expressed  as 
(Kapur  and  Lamberson  1977) 


R  =  1  -  -  exp  -  %  2p  X  -  X^o^ 

l  a  I  H  l  y  x  x  y 


p  -  X  a  2 


Dynamic  reliability  models 

87.  Dynamic  or  time-dependent  reliability  models  consider  repeated 
application  of  loading  and  also  can  consider  the  change  of  the  distribution  of 
resistance  with  time.  The  practical  motivation  behind  considering  time- 
dependent  risk  and  reliability  models  is  that,  for  hydraulic  structures,  there 
is  uncertainty  about  the  loading  and  resistance  random  variables  with  respect 
to  time  and  loading  cycles. 

88.  Repeated  loadings  on  a  hydraulic  structure  are  characterized  by  the 
time  each  load  is  applied  and  the  behavior  of  time  intervals  between  the 
application  of  loads.  From  a  reliability  theory  viewpoint,  the  uncertainty 
about  the  loading  and  resistance  variables  may  be  classified  into  three  cate¬ 
gories:  deterministic,  random  fixed,  and  random  independent  (Kapur  and 
Lamberson  1977).  For  the  deterministic  category,  the  variable  assumes  values 
that  are  exactly  known  a  priori.  For  the  random  fixed  case,  the  randomness 
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varies  in  time  in  a  known  manner.  For  the  random  independent  case,  the 
variable  is  not  only  random  but  the  successive  values  assumed  by  the  variable 
are  statistically  independent. 

89.  The  objective  of  the  reliability  computations  for  the  dynamic 
models  is  to  determine  the  reliability  after  n  cycles  or  occurrences  of 
loading  R^  ,  i.e.,  the  probability  of  not  having  a  failure  on  any  one  of  the 
n  cycles  or  loadings. 

90.  Reliability  computations  for  dynamic  (time-dependent)  models  can  be 
made  for  deterministic  and  random  cycle  times.  The  loading  on  water  distribu¬ 
tion  systems  can  be  deterministic  under  normal  loading  conditions  and  random 
under  emergency  loading  conditions.  The  model  for  deterministic  cycles  will 
be  developed  which  naturally  leads  to  the  model  for  random  cycle  times.  For 
deterministically  known  cycle  times,  the  reliability  at  time  t  ,  R(t)  ,  can 
be  determined  from  the  reliability  after  n  cycles  or  occurrences  of  loading 

R  as 
n 


R  ;  t  <  t  <  t 
n  n  n+1 


in  which  tn  -  the  instant  in  time  at  which  the  nth  cycle  occurs.  Referring 
to  the  loading  as  x  and  the  resistance  as  y  ,  Rr  can  be  expressed  as 


*n  "  pKxi  <  y)n<x2  <  y)n...n(xn  <  y)] 


R  -  P[ (max(x. ,x„. . ,x  )  <  y] 
n  1  c.  n 


By  letting  the  maximum  loading,  x  =  max(x. ,x„ . . ,x  )  ,  the  distribution 

®  max  12  n 

F  (x)  of  x  is 
n  max 


F  (x)  =  [F  (x)]‘ 
n  x 


provided  the  loadings  are  independent  and  identically  distributed,  in  which 
F  (x)  is  the  cumulative  distribution  of  the  loadings  or  hydrologic  events 
which  is  determined  from  integrating  the  probability  density  function  of  the 
loadings. 


91.  For  the  time-dependent  reliability  model  for  deterministic  cycles, 
the  reliability  is  expressed  as 


n  ‘  f  fy(y)  /  fx(x)  dx  n  dy  “  f  f  (uMFx(y)ln  dy 


(100) 


The  preceding  reliabilities  for  random  loading  times  can  be  expressed  in  terms 
of  time  as 


R(t)  «=  /  n  (t)R 
J  n  n  n 
n=0 


(101) 


where 

( t )  “  probability  of  n  loadings  occurring  in  the  time  interval  [0,t] 

*  probability  of  all  n  successes 

It  is  now  evident  that  the  case  of  deterministic  cycle  times  is  a  special  case 
of  the  preceding  reliability  equation  for  random  cycle  times. 

92.  A  Poisson  distribution  can  be  used  to  describe  the  probability  of 
the  number  of  events  occurring  in  a  given  time  interval,  given  as 


"n(t)  = 


-at,  . n 
e  (at) 

n ! 


(102) 


in  which  a  =  the  mean  rate  of  occurrence  of  the  loading  which  may  be 

estimated  from  historical  data.  For  example,  if  annual  data  are  being  used, 

a  =  1/T  in  which  T  is  the  return  period.  The  mean  and  variance  are  both 
r  r 

at  where  t  can  be  considered  as  the  expected  service  life  of  the  hydraulic 
structure.  Other  distributions  may  also  be  applicable  but  they  lead  to  more 
complicated  analysis. 

93.  For  the  random  independent  loading  and  random  fixed  resistance,  the 
time-dependent  reliability  can  be  expressed  as: 


PART  IV:  SYSTEM  RELIABILITY  ANALYSIS 


Simple  Systems 

96.  Most  systems  are  composed  of  several  subsystems.  The  reliability 
of  the  system  depends  on  how  the  components  are  interconnected.  Several 
methods  for  computing  system  reliability  are  presented  below. 

Series  systems 

97.  The  simplest  type  system  is  a  series  system  in  which  every  com¬ 
ponent  must  function  if  the  system  is  to  function.  Considering  the  random 
variable  of  the  time  of  failure  as  t^  for  the  i-th  component,  then  for  a 
system  of  n  components,  the  system  reliability  is 

n  n 

R  (t)  =  n  P(t,  >  t)  =  n  R(t)  =  (R.)(R,)...(R  )  (105) 

s  i=l  1  i=l  1  2 


where  R^(t)  is  the  reliability  for  the  i-th  component.  For  a  system  that 
has  failure  times  exponentially  distributed  (with  constant  failure  rates)  so 
that  the  i-th  component  reliability  is  e  * it  then  the  system  reliability  is 


Rs(t) 


The  MTTF  is 


(106) 


MTTF 


/ 


exp 


^-IAt  dt 


1 


n 


E 

i=l 


(107) 


As  an  example  of  the  series  system,  consider  two  different  pumps  in  series, 
both  of  which  must  operate  to  pump  the  required  quantity.  The  constant  fail¬ 
ure  rates  for  the  pumps  are  Aj  =  0.0003  failures/hr  and  A^  =  0.0002 
failures/hr  .  For  a  2,000-hr  mission  time,  the  system  reliability  is: 

R  (t)  *  e"(0. 0003+0. 0002) (2000)  =  0>904g4  (108 

s 
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and  the  MTTF  is 


MTTF  “ 


_ 1 _ 

0.0003  +  0.0002 


2,000  hr 


(109! 


Chain  series  system 

98.  A  series  chain  model  is  a  series  system  such  that  if  any  one  com¬ 
ponent  fails,  the  system  will  fail.  This  model  is  based  on  the  idea  of  a 
chain  composed  of  n  links  where  the  chain  will  break  if  the  applied  stress 
x  exceeds  the  strength  y  of  any  one  link.  This  model  is  also  referred  to 
as  a  weakest  link  model.  The  system  reliability  is  then 

R  ■  min  R.  (110 

s  ±  1 

The  reliability  for  any  one  link  is 

R  =  P(x  >  y)  -  /  /  f  (x)f  (y)  dy  dx  (111 

1  0  y  X  y 

where 

f(x)  *  probability  distribution  function  for  the  strength 
f(y)  •  probability  distribution  function  for  the  stress 

Parallel  system 

99.  A  parallel  system  is  defined  as  one  which  will  fail  if  and  only  if 
all  units  in  the  system  fail  or  malfunction.  The  pure  parallel  system  is  one 
in  which  all  components  are  initially  activated,  and  any  component  can  main¬ 
tain  the  system  operation.  The  system  reliability  is  then  expressed  as 

n 

R  (t)  -  1  -  n  [1  -  R.(t)J  (112 

8  i-1  1 

For  a  system  with  exponentially  distributed  time  to  failure  and  a  constant 
failure  rate  for  each  component  of  the  system,  the  system  reliability  is 


(113) 


n  -A  t 

R  (t)  =  1  -  n  (1  -  e  ) 
S  i =1 


and  the  MTTF  for  a  system  with  identical  components  is 


i  y  i 
a  ^  i 


(114) 


As  an  example  of  a  parallel  system,  consider  two  identical  pumps  operating  in 
a  redundant  configuration  so  that  either  pump  could  fail  and  the  peak  dis¬ 
charge  could  still  be  delivered.  Both  pumps  have  a  failure  rate  of 
A  =  0.0005  and  both  pumps  start  operating  at  t  =  0  .  The  system  reliability 
for  a  mission  time  of  t  =  1,000  hr  is 


-A  -2A 

R  (t)  =  2e  t  -e  C 


.  -(0.0005) (1000)  -2(0.0005) (1000) 

Ze  -  e 


(115) 


=  1.2131  -  0.3679  =  0.8452 


The  MTTF  is 


HTTF  -  I  (l  +  l)  ■  I  T  -  >-5  (does) 


=  3,000  hr 


(116) 


Standby  redundancy 

100.  A  standby  redundant  system  is  a  parallel  system  in  which  only  one 
component  or  subsystem  is  in  operation.  If  the  operating  component  fails, 
then  another  component  is  operated.  This  type  of  system  is  different  than  the 
parallel  network  where  all  the  components  are  operating  because  standby  units 
do  not  operate.  The  system  reliability  for  a  system  with  n  +  1  components 
in  which  one  component  is  operating  and  n  units  are  on  standby  until  the 
operating  unit  fails,  is  given  by 


».(«)  '  L  ^77 


1  -At 
e 


(117) 
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This  assumes  the  following:  the  switching  arrangement  is  perfect,  the  units 
are  identical,  the  component  failure  rates  are  constant,  the  standby  units  are 
as  good  as  new,  and  the  unit  failures  are  statistically  independent.  For 
n  +  1  nonidentical  components  with  different  failure  time  density  functions, 
the  system  reliability  is 

CD 

R  (t)  =  f  f  (t)  dt  (118) 

S  t 

where  f  (t)  is  the  standby  redundant  system  failure  density  given  by 
st 

t  y  y , 

fst(t)  =  /  /  '*•/  fl(yl)f2(y2  -  yl)”*ftt+l(t  ‘  yn}  dyl  dy2-*-dyn  (U9) 

O  O  O 

As  an  example  of  a  standby  redundant  system,  assume  an  exponential  failure 
distribution  and  consider  two  identical  pumps,  one  operating  and  the  second  on 
standby,  with  identical  failure  rates  of  X  *  0.0005  failures/hr.  The  standby 
unit  is  as  good  as  new  at  time  t  =  0  .  The  system  reliability  for 
t  “  1,000  hr  is 

R  (t)  «  (1  +  Xt)e"At  -  (1  +  0.0005)e“(0,0005)(10°0)  «  0.6068  (120) 

st 

101.  A  k-out-of-n  system  is  a  system  in  which  a  specified  number  of 
subsystems  must  be  good  for  system  success.  The  binomial  distribution  is  used 
to  define  the  system  reliability  for  k-out-of-n  of  independent  and  identical 
units  given  by 


For  a  constant  failure  rate  the  reliability  is  expressed  as 


-••t  1  n  -  e  ‘V 

1 1  -AM 

(  1  -  e  ) 


(122) 


As  an  example  of  a  k-out-of-n  system,  consider  a  pumping  system  with  three 
pumps,  one  of  which  is  standby,  all  with  constant  failure  rates  of  X  =  0.0005 
failures/hr.  The  system  reliability  for  t  =  1,000  hr  is 


R2/3  C  t)  =  3e 


-(2) (0.0005) (1000) 


-(3) (0.0005) (1000) 


(123) 


=  1.1036  -  0.4463  =  0.6573 


Complex  Systems 


102.  As  shown  in  the  previous  section,  the  reliability  of  series- 
parallel  systems  is  generally  straightforward.  In  most  practical  situations, 
such  as  water  distribution  systems,  the  system  (network)  has  a  nonseries- 
parallel  configuration  and  the  evaluation  is  much  more  difficult.  There  have 
been  many  techniques  developed  for  system  reliability  evaluation.  A  great 
deal  of  work  has  been  done  on  state  enumeration  methods  (event-space  methods), 
network  reduction  methods,  and  path  enumeration  methods.  A  brief  summary  is 
provided  of  each  of  these  methods. 

State  enumeration  methods 

103.  This  method  lists  all  possible  mutually  exclusive  states  of  the 
system.  A  state  is  defined  by  listing  the  successful  and  failed  elements  in 
the  system.  For  a  system  with  n  elements  or  components,  in  general  there 
are  2n  states,  so  that  a  system  with  10  components  would  have  1,024  states. 

The  states  which  result  in  successful  system  operation  are  identified  and  the 
probability  of  occurrence  of  each  successful  state  is  computed.  The  last  step 
is  to  sum  all  the  successful  state  probabilities  which  give  the  system  relia¬ 
bility.  This  method  can  be  computationally  infeasible  for  systems  having  a 
large  number  of  components  (Brown  1971). 

Network  reduction  methods 

104.  These  methods  combine  the  series,  parallel,  and  series-parallel 
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subsystems  until  a  nonseries-parallel  system  which  cannot  be  further  reduced 
is  obtained.  Factoring  theorems  are  then  used  to  obtain  system  reliability. 

A  component  A  is  selected,  and  two  networks  are  obtained  and  generated  when 
A  is  replaced  by  a  short-circuit  (perfect  competition)  and  an  open  circuit. 

If  the  two  networks  are  simple  series-parallel,  they  can  be  reduced;  other¬ 
wise,  the  next  block  A  must  be  selected  and  the  procedure  is  repeated. 
Further  discussions  of  network  reduction  methods  can  be  found  in  Moscowitz 
(1958),  Buzacott  (1970),  Banerjee  and  Rajamani  (1972),  and  Misra  (1972). 

Path  enumeration  methods 

105.  Path  enumeration  methods  are  very  valuable  tools  for  system  reli¬ 
ability  evaluation.  A  path  is  a  set  of  elements  (components)  which  form  a 
connection  between  input  and  output  when  traversed  in  a  stated  direction.  A 
minimal  path  is  one  in  which  no  node  is  traversed  more  than  once  in  going 
along  the  path.  The  i-th  minimal  path  will  be  denoted  as  P^  ,  i  =  l...m  . 
Assuming  any  path  is  operable,  the  system  performs  adequately,  then  the  system 
reliability  is 


R  -  P 


m 


U  p, 

i*=l 


(124) 


where  P^  [  ]  represents  probability  and  U  denotes  the  union. 

106.  A  cut  set  is  defined  as  a  set  of  elements,  which  if  it  fails, 
causes  the  system  to  fail  regardless  of  the  condition  of  the  other  elements  in 
the  system.  A  minimal  cut  is  one  in  which  there  is  no  proper  subset  of  ele¬ 
ments  whose  failure  alone  will  cause  the  system  to  fail.  In  other  words  a 
minimal  cut  is  such  that  if  any  component  is  removed  from  the  set,  the  remain¬ 
ing  elements  collectively  are  no  longer  a  cut  set.  The  minimal  cut  sets  are 
denoted  as  ,  i  =  l...m  and  C^,  denotes  the  complement  of  ,  i.e.  the 
failure  of  all  elements  of  the  cut  .  The  system  reliability  is 


R 


(125) 


107.  A  basic  algorithm  for  the  path  enumeration  method  can  be  stated  as 
(Henley  and  Gandhi  1975): 
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a.  Find  all  minimal  paths  using  the  reliability  graph.  Several 
computer  codes  have  been  developed  for  this  purpose  which  are 
discussed  in  a  later  section. 

b.  Find  all  required  unions  of  the  paths. 

c.  Give  each  path  union  a  reliability  expression  in  terms  of 
module  reliability. 

d.  Use  the  following  equation  expressing  the  system  reliability 
in  terms  of  module  reliabilities. 


where  the  members  of  the  i-th  path  are  denoted  as  1  e  P^,  the  union  of  the 
i-th  and  j-th  paths  are  denoted  by  1  e  P^U  V  etc- 

Review  of  system  reliability 
evaluation  techniques  for  complex  systems 

108.  Hwang,  Tillman,  and  Lee  (1981)  presented  a  review  of  literature 
related  to  system  reliability  evaluation  techniques  for  small  to  large  complex 
systems.  A  large  system  was  defined  as  one  which  has  more  than  ten  components 
and  a  moderate  system  as  one  which  has  more  than  six  components  and  less  than 
ten.  Complex  systems  were  defined  as  ones  which  could  not  be  reduced  to  a 
series-parallel  system. 

109.  Hwang,  Tillman,  and  Lee  concluded  that  for  a  large  complex  system 
computer  programs  should  be  used  that  provide  the  set  of  minimal  cuts  and  cal¬ 
culate  the  minimal  cut  approximation  to  system  reliability.  Minimal  paths  can 
be  generated  from  minimum  cuts.  Based  on  minimal  path  (cut)  sets,  reliability 
approximations  can  then  be  obtained  for  large  complex  networks.  Hwang,  Till¬ 
man,  and  Lee  also  noted  that  Monte  Carlo  methods  for  system  reliability  eval¬ 
uation  were  used  when  component  reliabilities  are  sampled  by  the  Monte  Carlo 
method.  Hwang,  Tillman,  and  Lee  also  identified  several  miscellaneous 
approaches  for  evaluating  complex  systems  including  a  moment  method,  a  block 
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diagram  method,  Bayesian  decomposition,  and  decomposition  by  Boolean 
expression. 

110.  Hwang,  Tillman,  and  Lee  (1981)  concluded  that  of  all  the  evalua¬ 
tion  techniques  in  the  papers  surveyed  only  a  few  had  limited  success  in  solv¬ 
ing  some  large  complex  system  reliability  problems  and  few  techniques  have 
been  completely  effective  when  applied  to  large  system  reliability  problems. 
They  felt  that  a  new  direction  should  be  to  develop  a  generally  efficient 
graph  partitioning  technique  for  reliability  evaluation  of  large,  highly 
interconnected  networks. 

111.  Since  the  1981  paper  of  Hwang,  Tillman,  and  Lee,  several  other 
system  reliability  evaluation  techniques  have  been  reported  in  the  literature. 
Aggarwal ,  Chopra,  and  Bajwa  (1982)  presented  a  method  that  uses  decomposition 
of  a  probabilistic  graph  using  cut  sets.  The  method  is  applied  to  a  simpli¬ 
fied  network  with  five  nodes  and  seven  links  and  only  limited  computational 
results  are  presented. 

112.  Bennetts  (1982)  presents  a  method  for  the  analysis  of  reliability 
block  diagrams  using  Boolean  algebra  techniques.  The  method  is  based  on  an 
analysis  of  path  sets  derived  from  reliability  block  diagrams.  Boolean  meth¬ 
ods  are  applied  to  each  path  so  that  the  component  reliability  parameters  are 
considered  to  be  Boolean  variables  rather  than  probabilistic  variables  and  the 
whole  problem  is  treated  in  a  Boolean  framework.  Hagstrom  (1983)  presents  a 
model  using  decomposition  trees  of  a  network  based  upon  finding  and  analyzing 
triconnected  components  of  the  network. 

113.  Deuermeyer  (1982)  presented  an  interesting  approach  to  network 
reliability  analysis  of  flow  networks  that  is  based  upon  developing  network 
functions.  A  network  function  specifies  the  maximum  flow  deliverable  by  the 
network  while  in  a  specific  state.  The  maximum  flow  problem  can  be  repre¬ 
sented  as  a  linear  programming  problem  in  which  the  objective  is  to  maximize 
flow.  The  probability  distribution  of  maximum  flow  can  then  be  determined  and 
used  as  an  index  of  reliability. 

114.  Touey  (1983)  presented  a  new  algorithm  for  computing  network  term¬ 
inal  reliability  from  a  set  of  paths  or  cut  sets.  This  algorithm  is  based  on 
selective  generation  of  relevant  states  by  way  of  methods  for  choosing  and 
pruning  branches  of  a  binary  tree.  The  author  states  that  the  method  is  easy 
to  implement  and  to  understand,  and  has  proven  in  practice  to  be  more  effi¬ 
cient  than  the  fastest  methods  published. 


Multistate  systems 

with  multi-state  components 

115.  Hudson  and  Kapur  (1982)  present  models  for  reliability  analysis  to 
systems  which  can  have  a  range  of  states  and  all  of  its  components  can  also 
have  a  range  of  multiple  states.  Such  systems  generally  have  various  levels 
of  operational  performance  so  that  the  total  system  effectiveness  measures 
reflect  all  the  performance  levels  and  their  reliabilities.  Binary  system 
theory  requires  that  each  component,  as  well  as  the  entire  system,  be  consid¬ 
ered  either  functioning  or  failed.  Multistate  approaches  allow  states  of  par¬ 
tial  failure  for  both  the  system  and  its  components.  The  advantage  is  that 
either  standby  or  active  redundancy  can  be  considered.  The  methodology  pre¬ 
sented  in  Hudson's  and  Kapur's  paper  is  illustrated  by  a  simple  example  of  a 
domestic  hot  water  system  consisting  of  components  representing  a  gas  fired 
subsystem,  a  solar  collector-controller,  two  pumps,  and  a  solar  piping  and 
storage  subsystem. 

116.  This  type  of  approach  seems  to  be  in  the  developmental  stages  and 
may  be  a  little  premature  for  application  to  water  distribution  systems.  How¬ 
ever,  once  the  technology  is  developed,  this  should  prove  to  be  very  promis¬ 
ing.  Earlier  work  on  the  multistate  (discrete  state)  point  of  work  was 
reported  by  Dhillon  (1975),  Murchland  (1975),  Barlow  and  Wu  (1978),  and 
El-Neweihi,  Proschan,  and  Sethuraman  (1978). 

Fault  Tree  Analysis 


117.  Fault  tree  analysis  has  been  proposed  as  a  method  for  evaluating 
the  reliability  of  systems.  A  fault  tree  is  a  logical  diagram  representing 
the  consequences  of  the  component  failures  (basic  or  primary  failures)  on 
system  failure  (top  failure).  Dhillon  and  Singh  (1981)  defined  the  advantages 
and  disadvantages  of  the  fault  tree  analysis  technique.  Advantages  include: 

a.  Provides  insight  into  the  system  behavior. 

b.  Requires  the  reliability  analyst  to  understand  the  system 
thoroughly  and  deal  specifically  with  one  particular  failure 
at  a  time. 

c.  Helps  to  ferret  out  failures  deductively. 

d.  Provides  a  visibility  tool  to  designers,  users,  and  management 
to  justify  design  changes  and  trade-off  studies. 
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e.  Provides  options  to  perform  quantitative  or  qualitative  reli¬ 
ability  analysis. 

f .  Technique  can  handle  complex  systems  more  easily. 

Disadvantages  include: 

a.  Can  be  costly  and  time-consuming. 

b.  Results  can  be  difficult  to  check. 

c.  Technique  normally  considers  that  the  system  components  are  in 
either  working  or  failed  state;  therefore,  the  partial  failure 
states  of  components  are  difficult  to  handle. 

d.  Analytical  solution  for  fault  trees  containing  standbys  and 
repairable  components  are  difficult  to  obtain  for  the  general 
case . 

e.  To  include  all  types  of  common-cause  failure  requires  con¬ 
siderable  effort. 

118.  Another  advantage  not  mentioned  by  Dhillon  and  Singh  (1981)  is 
that  commercial  codes  are  available  to  perform  the  analysis. 

Fault  tree  construction 

119.  Before  constructing  a  fault  tree,  the  analyst  must  thoroughly 
understand  the  system  and  its  intended  use.  One  must  determine  the  higher 
order  functional  events  and  continue  the  fault  event  analysis  to  determine 
their  logical  relationship  with  lower  level  events.  Once  this  is  accomp¬ 
lished,  the  fault  tree  can  be  constructed.  A  brief  description  of  fault  tree 
construction  is  presented  below.  The  basic  concepts  of  fault  tree  analysis 
are  presented  in  Henley  and  Kumamoto  (1981)  and  Dhillon  and  Singh  (1981). 

120.  The  major  objective  of  fault  tree  construction  is  to  represent  the 
system  condition,  which  may  cause  system  failure,  in  a  symbolic  manner.  In 
other  words  the  fault  tree  consists  of  sequences  of  events  that  lead  to  the 
system  failure.  These  sequences  of  events  are  represented  by  AND,  OR,  or 
other  logic  gates.  There  are  actually  two  types  of  building  blocks:  gate 
symbols  and  event  symbols. 

121.  Gate  symbols  connect  events  according  to  their  casual  relation 
such  that  they  may  have  one  or  more  input  events  but  only  one  output  event. 
Table  9  lists  the  various  gate  symbols  (Henley  and  Kumamoto  1981).  The  AND 
gate  denotes  that  an  output  event  occurs  if  and  only  if  all  the  input  events 
occur.  The  OR  gate  is  an  intermediate  event  which  denotes  that  there  is  no 
output  unless  one  and  only  one  of  the  input  events  occurs.  The  priority  AND 
gate  is  logically  equivalent  to  an  AND  gate  with  the  exception  that  the  input 
events  must  occur  in  a  specific  order.  The  inhibit  gate  produces  output  only 


when  the  conditional  input  is  satisfied  and  is  logically  equivalent  to  an  AND 
gate  with  two  input  events. 

122.  Event  symbols  are  shown  in  Table  10.  A  fault  event,  denoted  by  a 
rectangular  box,  results  from  a  combination  of  more  basic  faults  acting 
through  logic  gates.  A  circle  denotes  a  basic  component  failure  that  repre¬ 
sents  the  limit  of  resolution  of  a  fault  tree.  A  diamond  represents  a  fault 
event  whose  causes  have  not  been  fully  developed.  The  house-shaped  event 
denotes  a  fault  event  which  is  expected  to  occur.  A  triangle  denotes  a  trans¬ 
fer  IN  or  OUT  and  is  used  to  avoid  repeating  sections  of  the  fault  tree. 

123.  There  are  two  approaches,  forward  analysis  and  backward  analysis, 
for  analyzing  causal  relations.  Forward  analysis  starts  with  a  set  of  failure 
events  and  proceeds  forward,  looking  for  possible  consequences  resulting  from 
the  events.  The  backward  analysis,  which  is  used  in  fault  tree  analysis, 
begins  with  a  system  hazard  (failure)  and  traces  backward,  searching  for  pos¬ 
sible  causes  of  the  hazard. 

124.  Henley  and  Kumamoto  (1981)  present  heuristic  guidelines  for  con¬ 
structing  fault  trees  which  are  summarized  in  Table  11  and  Figure  9  and  are 
listed  on  the  following  page: 


Figure  9.  Development  of  a  component 
failure  or  state-of-component  event 
(Henley  and  Kumamoto  1981).  Reprinted 
by  permission  of  Prentice-Hall,  Inc., 


Englewood  Cliffs,  N.J.,  Rel iabi 1 ity 
Engineering  and  Risk  Assessment,  1981, 


a.  Replace  abstract  events  by  less  abstract  events. 

b.  Classify  an  event  into  more  elementary  events. 

c.  Identify  distinct  causes  for  an  event. 

d.  Couple  trigger  event  with  "no  protection  actions." 

e.  Find  cooperative  causes  for  an  event. 

Pinpoint  component  failure  events. 

£.  Develop  component  failure  using  Figure  9. 

125.  An  example  of  a  fault  tree  construction  is  given  for  the  system  in 
Figure  10.  In  this  pumping  system,  the  tank  is  filled  in  10  min  and  empties 
in  50  having  a  cycle  time  of  60  min.  After  the  switch  is  closed,  the  time  is 
set  to  open  the  contacts  in  10  min.  If  the  mechanism  fails  then  the  horn 
sounds  and  the  operator  opens  the  switch  to  prevent  pressure  tank  rupture. 

The  fault  tree  for  the  pumping  system  is  shown  in  Figure  11. 
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Figure  10.  Schematic  diagram  of  a  pumping 
system  (Henley  and  Kumamoto  1981). 
Reprinted  by  permission  of  Prentice-Hall, 
Inc.,  Englewood  Cliffs,  N.J.,  Reliability 
Engineering  and  Risk  Assessment,  1981, 
p  72 


Evaluation  of  fault  trees 

126.  The  basic  steps  used  to  evaluate  fault  trees  include: 

a.  Construct  the  fault  tree. 

b.  Determine  the  minimal  cut  sets. 
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Figure  11.  Fault  tree  for  pumping  system  shovm  in  Figure  10  (Henley 
and  Kumamoto  1981).  Reprinted  by  permission  of  Prentice-Hall,  Inc., 
Englewood  Cliffs,  N.J.,  Reliability  Engineering  and  Risk  Assessment, 

1981,  p  73 

c.  Develop  primary  event  information. 

d.  Develop  cut  set  information. 

e.  Develop  top  event  Information. 

127.  In  order  to  evaluate  the  fault  tree,  one  should  always  start  from 
the  minimal  cut  sets  which  in  essence  are  critical  paths.  Basically,  the 
fault  tree  evaluation  comprises  two  distinct  processes:  (a)  the  determination 
of  the  logical  combination  of  events  that  cause  top  event  failure  expressed  in 


minimal  cut  sets  and  (b)  the  numerical  evaluation  of  the  expression. 

128.  Cut  sets  are  collections  of  basic  events  such  that  if  all  these 
basic  events  occur,  then  the  top  event  is  guaranteed  to  occur.  The  path  set 
is  a  dual  concept  to  the  cut  set  in  that  it  is  a  collection  of  basic  events, 
and  if  none  of  the  events  in  the  set  occur,  then  the  top  event  is  guaranteed 
to  not  occur.  As  one  could  imagine,  a  large  system  has  an  enormous  number  of 
failure  modes.  A  minimal  cut  set  is  one  that  if  any  basic  event  is  removed 
from  the  set  then  the  remaining  events  collectively  are  no  longer  a  cut  set. 

By  the  use  of  minimal  cut  sets,  the  number  of  cut  sets  and  basic  events  are 
reduced  in  order  to  simplify  the  analysis.  Several  computer  codes  are  avail¬ 
able  for  generating  cut  sets  including  MOCUS  (Fussell,  Henry,  and  Marshall 
1974)  which  was  developed  to  obtain  minimal  cut  sets  from  fault  trees.  A  list 
of  available  computer  codes  for  reliability  analysis  is  found  in  Appendix  B. 

129.  The  system  availability  Ag(t)  is  the  probability  that  the  top 
event  does  not  exist  at  time  t  ,  which  is  the  probability  of  the  systems 
operating  successfully  when  the  top  event  is  an  OR  combination  of  all  system 
hazards.  System  unavailability  Qg(t)  is  the  probability  that  the  top  event 
exists  at  time  t  ,  which  is  .either  the  probability  of  system  failure  or  the 
probability  of  a  particular  system  hazard  at  time  t  .  The  system  availa¬ 
bility  and  system  unavailability  are  complementary,  i.e. 

A  (t)  +  Q  (t)  =  1  (127) 

s  s 

130.  System  reliability  Rg(t)  is  the  probability  that  the  top  event 
does  not  occur  over  time  interval  (0,t)  .  System  reliability  requires  con¬ 
tinuation  of  the  nonexistence  of  the  top  event  and  the  following  holds 

R  (t)  s  A  (t)  (128) 

s  s 

131.  The  system  unreliability  Fg(t)  is  the  probability  that  the  top 
event  occurs  before  time  t  and  is  complementary  to  the  system  reliability 


R  (t)  +  F  (t)  =  1 


(129) 
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The  system  failure  density  F  (t)  is  defined  as 

s 


(130) 


dFg (t) 


(131) 


132.  System  conditional  failure  intensity  Vg(t)  is  the  probability 
that  the  top  event  occurs  per  unit  time  at  time  t  given  that  it  does  not 
exist  at  time  t  .  The  system  unconditional  failure  intensity  Wg(t)  is  the 
probability  that  the  top  event  occurs  per  unit  time  at  time  t  .  The  expected 
number  of  top  events  during  time  interval  (t,t+dt)  is 


t+dt 

W  (t,t+dt)  =  /  w  (t)  dt 

s  _/  s 


(132) 


133.  The  mean  time  to  first  failure  is  the  expected  length  of  time  to 
the  first  occurrence  of  the  top  event  and  is  given  by 


MTTF  =  f  tf  (t)  dt 
s  Jq  s 


(133) 


134.  Considering  independent  basic  events  B^...Bn  ,  the  probability  of 
a  cut  set  occurrence  at  time  t,Q*(t)  is  obtained  from  the  intersection  of 


the  basic  events  as 


q* (t)  =  p  (B,nB„n...nB  )  =  n  Q.(t) 
r  1  2  n  j-1  3 


(134) 


where  n  is  the  number  of  cut  set  members  and  Qj(t)  is  the  probability  of 
the  j-th  basic  event  existing  at  time  t  .  A  cut  set  occurrence  is  when  all 
basic  events  in  the  cut  set  are  occurring.  The  asterisk  *  is  used  to  denote 
that  the  quantity  is  a  cut  set.  The  notation  Q(t)  refers  to  a  component 
unavailability,  Q*(t)  refers  to  the  cut  set  unavailability,  and  Qg(t) 
refers  to  the  system  unavailability. 


135.  The  probability  of  occurrence  of  a  cut  set  per  unit  time  at  time 
t  given  no  cut  set  failure  at  time  t  is  denoted  as  A*(t)  .  The  probabil¬ 
ity  that  the  cut  set  occurs  during  the  time  interval  (t,t+dt)  is 


A*(t)  dt  =  P[C*(t,t+dt)  C*(t)] 


Pr[C*(t,t+dt)] 

P[C*(t)] 


(135) 


where 

C*(t,t+dt)  =  occurrence  of  the  cut  set  during  (t,t+dt) 

C*(t)  =  the  nonexistence  of  the  cut  set  failure  at  time  t 

Henley  and  Kumamoto  (1981)  show  that  the  numerator  is  W*(t)  dt  so  that 


ii 

W  (t)  dt  n  Q  (t) 
1=1  2  k=l 


A* (t)  dt  = 


1  -  Q*(t) 


Each  term  in  the  summation  is  the  probability  of  the  j-th  basic  event  during 
(t , t+dt)  with  the  remaining  basic  event  existing  at  time  t  .  The  denomi¬ 
nator  is  the  probability  of  the  nonexistence  of  the  cut  set  failure  at  time 
t  . 

136.  The  term  W*(t)  is  the  expected  number  of  times  the  cut  set 
occurs  per  unit  time  at  time  t  defined  as 


W*(t)  = 


£  ”j(t)  ",  V‘> 


so  that 


A*  ( t ) 


w*(t) 

[1  -  Q*(t) ] 


137.  Similar  expressions  hold  for  u*(t)  and  v*(t) 


II 


I 


V 


t: 
».■  • 

£ 


k 


v*(t)  =  V  v  (t)  n 
j-i  J  k-i 


fi  -  Qk(t)j 


(139) 


and 


u*(t) 


v*(t) 

Q*(t) 


(140) 


138.  The  values  of  W*(0,t)  and  V*(0,t)  are 


W*(0,t)  “  J  w*(u)  du 
0 


V*(0,t)  “  f  v*(u)  du 
0 


(141) 


(142) 


139.  Henley  and  Kumamoto  (1981)  show  that  the  system  unavailability  can 
be  determined  using 
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where 
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c  i-1 

£  £  n  Q(t)  + 

i-2  J-l  i» j 


+  <-1)n'‘  .<1  SN  i  "l  Q<t) 

12  m  c  1  m 


N  -1 

+...+  (-1)  c  n  Q(t) 
ir**iN 


JI  *  product  of  Q(t)  for  the  basic  events  in  cut  set  i,  or  i^...  or 

i  1 
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N  *  total  number  of  minimal  cuts 
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The  lower  and  upper  bounds  for  Qg(t)  can  be  written  as  (Henley  and  Kumamoto 
1981)  : 


c  1-1 


C  Q*(t)  '  J]  J]  ”  Q(t)  *  Qs(t)  <  Qi(t) 

L— 1  1=2  j=l  1,J  1=1 


(144) 


where  II  refers  to  the  product  of  cut  sets  i  or  j  . 

140.  The  expected  number  of  times  the  top  event  occurs  at  time  t  ,  per 

unit  time,  is  w  (t)  .  Let  e.  be  the  event  that  the  i-th  cut  set  failure 
s  1 

occurs  at  time  t  to  t+dt  so  that  =  W  *(t)  dt  .  For  the  top  event 

to  occur  in  time  (t,t+dt)  ,  none  of  the  cut  set  failures  can  exist  at  time  t 
and  one  or  more  must  fail  during  the  time  t  to  t+dt  ,  so  that 


wg(t)  dt  =  Pr  I  A  |J 


(145) 


where  A  O  e  is  A  fl  U  e 
i-1  V  i=l  1 


D  e,  l=  the  event  that  one  or  more  of  the  cut  set  failures  occur 
i=l  J  at  time  t 

A  =  the  event  of  none  of  the  cut  sets  failures  existing  at 
time  t 


This  can  be  reduced  to  (Henley  and  Kumamoto  1981): 


W  (t)  =  -  W^2) (t) 

SS  S 


(146) 


where 


contribution  from  the  event  that  one  or  more  cut  sets 
fail  during  time  (t,t+dt) 

those  cases  in  which  one  or  more  cut  sets  fail  during 
(t,t+dt)  ,  while  the  other  cut  sets  that  have  already 
failed  to  time  t  have  not  been  repaired 


i  FI 'JI'  l'JMflr1 


l> 

s 


141.  Computer  programs  have  been  developed  to  compute  system  parameters 
(unavailability,  availability,  expected  number  of  failures  and  repairs,  and 
conditional  failure  and  repair  intensities)  given  minimal  cut  or  path  sets  of 
large  complicated  fault  trees.  KITT-1  (Vesely  and  Narum  1970)  applies  the 
above  concepts  of  kinetic  tree  theory.  The  program  handles  independent  basic 
events  which  are  either  repairable  or  nonrepairable  and  have  constant  failure 
rates  and  constant  repair  rates  u  .  Another  version  of  the  program,  KITT-2, 
allows  for  time-varying  failure  and  repair  rates.  A  later  version  called 
KITT-1T  (Ong  and  Henley  1980)  is  a  modified  version  of  KITT-1  to  include  time 
delays  provided  by  storage  tanks  and  component  (standby)  redundancy. 
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PART  V:  AVAILABILITY  AND  RELIABILITY  OF  WATER 
DISTRIBUTION  SYSTEM  COMPONENTS 


Background 

142.  As  discussed  in  previous  parts  of  this  report,  there  are  a  number 
of  techniques  available  for  evaluating  the  reliability  of  a  mechanical  system. 
Because  of  system  complexity,  these  procedures  have  not  been  generally  used  to 
evaluate  the  reliability  of  water  distribution  systems. 

143.  However,  simplifying  assumptions  can  be  made  that  enable  the  engi¬ 
neer  to  apply  these  procedures  to  water  distribution  system  components.  The 
following  sections  present  detailed  concepts  for  applying  these  procedures  to 
water  distribution  system  component  analysis. 

Pumps 

144.  Time  to  failure  analysis  can  be  applied  to  the  evaluation  of  pump¬ 
ing  systems.  For  the  sake  of  simplicity,  the  exponential  distribution  is  used 
to  illustrate  a  procedure  for  the  time  to  failure  analysis  of  a  pump  in  which 
the  pump  failure  data  are  lumped,  i.e.  failure  data  for  the  pumps'  individual 
subsystems  are  lumped  into  one  parameter.  A  more  detailed  analysis  in  which 
the  reliability  and  availability  of  the  individual  subsystem  is  presented  in 
the  following  section. 

145.  Damelin,  Shamir,  and  Arad  (1972)  presented  data  for  a  pump  indicat 
ing  an  MTTF  of  1,200  hr  and  a  MTTR  of  50  hr  for  a  * r|0-m3/hr-capacity  pump. 
Equations  54  and  56  can  be  used  to  calculate  values  for  failure  rate  A  and 
repair  rate  p  of  0.0008/hr  (7.3/year)  and  0.02/hr  (175.2/year),  respec¬ 
tively.  Table  12  presents  reliability,  unreliability,  and  availability  values 
for  these  values  of  A  and  p  . 

146.  The  reliability  of  a  system  is  the  probability  that  the  system 
experiences  no  failures  during  the  time  interval  (0,t)  .  The  reliability 
curve  for  A  =  0.0008  is  shown  in  Figure  12.  The  availability  of  a  system  is 
the  probability  that  the  system  is  operational  at  time  t  .  For  repairable 
systems,  availability  is  a  more  appropriate  measure  of  the  probability  that  a 
system  will  be  operational.  Availability  is  affected  by  both  the  MTTF  and 
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Figure  12.  Reliability  curve  for  a  pump  system  with 

A  =  0.0008/hr 


MTTR  .  Figure  13  presents  availability  curves  for  A  ■>  0.0008/hr  and 
p  =  0.02,  0.01,  and  0.005/hr  . 

147.  For  repairable  systems,  the  availability  is  always  greater  than  or 
equal  to  the  reliability.  This  concept  is  illustrated  graphically  in  Fig¬ 
ure  14  for  A  =  0.0008/hr  and  p  ■  0.01/hr  . 

148.  For  repairable  systems,  as  t  approaches  Infinity,  the  avail¬ 
ability  approaches  a  constant  value  greater  than  0  (stationary  availability). 

A  comparison  is  of  the  effect  of  both  p  and  A  on  the  stationary  availa¬ 
bility  and  unavailability  (Figure  15). 

Pumping  Stations 


Component  description 

149.  Pumping  stations  are  one  of  the  major  components  of  a  water  dis¬ 
tribution  system.  A  pumping  station  consists  of  one  or  more  pumping  units  or 
systems  supported  by  appropriate  electrical,  piping,  and  structural  systems. 
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Figure  13.  Availability ' curves  for  a  pump  system  with  A  *  0.0008/hr 
and  U  *  0.02,  0.01,  and  0.005/hr 


A  typical  pumping  station  is  illustrated  schematically  in  Figure  16.  The 
pumping  unit  is  the  primary  system  within  the  pumping  station  and  includes 
five  major  subsystems:  pump,  motor  (driver),  power  transmission,  valves,  and 
controls.  The  evaluation  of  pumping  station  reliability  is  a  three-step  pro¬ 
cess  incorporating  the  analysis  of  subsystem  reliability,  system  reliability, 
and  finally  the  reliability  of  the  pumping  station  component  of  the  water  dis¬ 
tribution  system.  The  reliability  and  availability  evaluation  can  be  con¬ 
ducted  to  any  desired  level  of  detail  for  which  data  are  available.  For 
purposes  of  discussion  and  illustration,  the  evaluation  technique  presented 
below  will  evaluate  the  pumping  station  as  being  composed  of  four  major  sys¬ 
tems:  pumping  system,  electrical  system,  piping  system,  and  structural 
system.  The  pumping  system  will  be  further  divided  into  the  above  listed  five 
major  pumping  unit  subsystems. 


Design  guidance 


150.  A  common  problem  in  the  design  and  operation  of  water  treatment 


and  distribution  systems  is  deciding  the  number  and  sizes  of  pumps  to  be 


68 


A(t)  FOR  X  =  0.0008 

F  =  0.02 
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Figure  14.  Comparison  of  reliability  and  availability 
for  A  =  0.0008/hr  and  W  =  0.02/hr 


installed  in  a  pumping  station.  Traditionally,  this  has  been  accomplished 
using  various  rules  of  thumb  or  good  engineering  judgment.  Little  attention 
has  been  given  to  the  design  of  pumping  stations  based  on  the  concepts  of 
system  reliability  or  availability.  The  selection  of  the  number  of  pumps 
usually  depends  on  the  size  of  the  installation  and  the  engineer's  preference. 
The  most  common  design  basis  is  the  concept  of  firm  pumping  capacity.  Firm 
capacity  is  defined  as  the  pumping  capacity  of  the  facility  with  the  largest 
pump-out  operation.  The  Great  Lakes-Upper  Mississippi  River  Board  of  State 

Sanitary  Engineers  (1972)  requires  that  at  least  two  pumping  units  be  provided 

except  where  ample  time  will  be  available  between  pumping  periods  for  neces¬ 
sary  repairs.  If  only  two  units  are  provided,  each  unit  is  required  to  have 
sufficient  capacity  to  supply  the  peak  demand.  When  more  than  two  pumping 

units  are  provided,  the  capacity  of  the  pumps  should  be  sufficient  to  meet 

peak  demand  with  one  pump  out  of  service.  Sanks  (1978)  suggests  that  the 
design  capacity  of  a  pumping  facility  be  based  on  the  maximum  daily  flow  rate 
with  peak  hourly  rates  supplied  from  system  storage.  Sanks  suggests  use  of 


the  firm  pumping  capacity  concept  and  indicates  that,  for  small  and  medium 
size  water  systems,  providing  firm  design  capacity  with  the  largest  unit  out 
of  service  gives  a  reasonable  degree  of  reliability.  No  guidance  is  provided 
on  the  selection  of  an  appropriate  number  of  pumps.  Quantitative  guidance 
concerning  the  number  of  pumps  to  be  selected  is  generally  lacking. 

Subsystem  reliability  and  availability 

151.  The  first  step  in  analyzing  the  reliability  and  availability  of 
the  pumping  station  is  to  determine  the  reliability  and  availability  of  the 
individual  subsystems  within  the  pumping  station.  Assuming  an  exponential 
failure  distribution,  a  constant  failure  rate,  and  a  constant  repair  rate, 
reliability  and  availability  of  the  pumping  system  subsystems  can  be  evaluated 
analytically.  Using  the  data  developed  by  Shultz  and  Parr  (1981),  reliability 
and  availability  for  the  various  pumping  unit  subsystems  are  presented  in 
Table  13. 

Pumping  system 
reliability  and  availability 

152.  Once  the  reliability  of  the  individual  subsystems  has  been  com¬ 
puted,  the  reliability  of  the  individual  pumping  units  can  be  determined.  If 
it  is  assumed  that  each  of  the  subsystems  must  be  in  operating  condition,  the 
pumping  unit  can  be  evaluated  as  a  simple  series  Reliability  system.  From 
reliability  theory,  the  reliability  of  a  series  system  can  be  computed  using 
Equation  104.  Thus,  the  reliability  of  the  individual  pumping  units  may  be 
calculated  as  follows. 

R  -  (RVO  ) (RVO  ) (RVO  ) (RV0DT) (RVO  )2  (147) 

s  p  m  c  r  I  v 


where 

Rg  =  reliability  of  the  pumping  unit  system 

RVOp  =  reliability  of  the  pump 

RVO  =  reliability  of  the  motor 

RVO^  *  reliability  of  the  control 

RV0pT  *  reliability  of  the  power  transmission 

RVO  =  reliability  of  the  valves  (note  one  valve  on  intake  and  one 
valve  on  discharge) 


Thus,  pumping  unit  reliability  for  a  10,000-hr  mission  life  is  calculated  as 
follows: 


R  =  (0.732086) (0.860773) (0. 755224) (0.887235) (0.500313) 2 
s 


048) 


R  =  0.105694 
s 

For  pumping  units,  reliability  is  often  measured  in  terms  of  operational 
availability,  which  is  a  measure  of  the  fraction  of  calendar  time  during  which 
the  pumping  unit  was  available  for  service.  Using  the  data  collected  by 
Shultz  and  Parr  (1981)  and  assuming  that  the  unavailabilities  of  the  indi¬ 
vidual  pumping  unit  subsystems  do  not  intersect,  a  conservative  estimate  of 
pumping  unit  availability  can  be  calculated  in  a  manner  similar  to  that  for 
reliability  using  Equation  104.  Thus,  for  a  10,000-hr  mission  time,  the 
availability  of  a  pumping  unit  can  be  calculated  as  follows: 


A  =  (0. 999703) (0. 999897) (0.99936) (0.999956) (0.999197) 2 

u 

A  =  0.997888 

b 


(149) 


This  indicates  that  the  pumping  unit  will  be  available  for  9,979  hr  out  of  the 
10,000-hr  mission  life,  or  conversely,  during  the  mission  life  the  pumping 
unit  will  be  out  of  service  for  approximately  21  hr. 

Pumping  station 
reliability  and  availability 

153.  The  results  of  the  system  reliability  and  availability  evaluation 
are  used  to  analyze  component  reliability  and  availability.  The  component  is 
the  pumping  station  in  its  entirety.  The  pump  station  is  essentially  a 
k-out-of-n  reliability  system.  The  technique  for  evaluation  of  pump  station 
availability  can  best  be  illustrated  through  use  of  an  example  computation. 

154.  A  common  design  practice  is  to  install  sufficient  pumps  to  handle 
peak  flows  and  include  a  spare  pump  of  equal  size  to  accommodate  any  downtime 
of  the  other  pumps.  Thus,  the  capacity  of  any  pump  (assuming  equal  pump 
sizes)  can  be  calculated  using  the  following  formula: 


s 


Q  •  capacity  of  the  individual  pumping  units 
Q^j,  =  design  flow  (peak  flow) 
n  “  number  of  pumping  units  in  the  pumping  stations 

155.  For  any  installation,  there  are  n  +  1  possible  capacity  states, 
i.e.  n  +  1  possible  pumping  capacities.  The  individual  pumping  units  have 
two  possible  operational  states:  available  and  unavailable.  Letting  A 
represent  the  percentage  of  time  that  a  pumping  unit  is  available  and  Q 
represent  the  percentage  of  time  that  a  pumping  unit  is  not  available  (note 
that  Q  *  1  -  A  ) ,  the  percentage  of  time  that  a  pumping  station  is  in  each 
capacity  state  can  be  evaluated  using  Equation  120. 

156.  For  a  three-pumping-unit  installation,  we  obtain  the  following 
equation: 


(151) 


Each  term  in  the  above  equation  represents  the  proportion  of  time  that  the 
system  will  be  at  a  respective  capacity  state.  For  example,  the  term 

A^Q^  yields  the  percentage  of  time  that  three  pumps  are  operational.  The 
state  capacity  is  obtained  by  multiplying  the  ‘number  of  available  pumps  by  the 
individual  unit's  pumping  capacity.  Thus,  for  a  three-pump  installation,  the 
following  state  time  proportion  matrix  can  be  calculated: 


State 


Time  ProDortion  Hours 


0.993653  9936.53 


0.006333 


63.33 


0.000013 


0,000001 

1.000000 


0.01 

10,000.00 


The  design  mission  life  is  10,000  hr.  The  estimated  number  of  hours  that  the 
pumping  station  is  not  expected  to  be  capable  of  pumping  at  the  combined  rate 
of  Qt  is  10,000  -  9999.86  or  0.14  hr.  For  practical  purposes,  such  a  system 
appears  to  be  100-percent  available  to  pump  the  design  flows. 


Water  Distribution  Piping 


Failure  rates 


157.  Regression  equations  can  be  developed  for  the  break  rates  of  water 
mains  using  data  from  specific  water  distribution  systems.  As  an  example, 
Walski  and  Pelliccia  (1982)  developed  break  rate  regression  equations  for  the 
Binghamton,  N.Y. ,  system  (Figure  17).  These  equations  are: 


Pit  Cast  Iron:  N(t)  *  0.02577  e 


0.0207(t-k) 


(152) 


Sandspun  Cast  Iron:  N(t)  *  0.0627  e 


0. 0137(t-k) 


(153) 


where 

N(t)  »  break  rate,  breaks/mile/year 
t  »  year 

k  =  year  of  pipe  installation 

Walski  and  Pelliccia  (1982)  state  that  because  of  such  factors  as  the  severity 
of  different  winters,  soil  conditions,  and  construction  techniques,  the  break 
rate  may  vary  significantly  between  systems  and  even  from  year  to  year  within 
a  given  system. 

158.  Walski  and  Pelliccia  (1982)  also  developed  a  regression  equation 
for  the  time  required  to  repair  pipe  breaks. 


t  -  6.5  d 


0.285 


(154) 


where 

t  -  time  to  repair,  hr 
d  *  pipe  diameter,  in. 

159.  A  study  of  the  Philadelphia,  Pa.,  water  system  (Weiss  et  al.  1985) 
developed  the  time  to  repair  valves  tabulated  below. 


.  vj  --r.  -r.  -r:'-'.  »p.  i". 


Pipe  Diameter  MTTR 

_ in. _  hr 

6  8.717 

8  9.079 

10  11.746 

12  16.460 

16  &  larger  24.360 


Figure  17.  Break  rate  curves  for  pit  cast  iron  and  sandspun 

cast  iron  pipes 


Reliability  and  availability 

160.  Techniques  for  evaluating  the  reliability  and  availability  of 
water  mains  can  best  be  illustrated  through  use  of  a  simple  example.  Consider 
a  5-mile  water  main  of  sandspun  cast  iron  pipe.  From  Equation  152,  the  break 
rate  per  year  (failure  rate)  can  be  calculated  as  follows: 

5  miles  x  N(t)  =  5  x  0.0627  g0-0137^-1^  =  0.3185  g0-0137^"1^  (155) 

161.  The  reliability  for  the  5-mile  water  main  can  be  computed  using 
Equation  65  as  follows: 
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R(t)  =  exp 


-  f  (0.3185  e 
J0 


0.0137c 


)  dt 


R(t)  =  exp  [  -23. 25  e 


0.0137c 


R(c)  =  exp  ^23.25  (1  -  e°,0137t)] 


162.  The  failure  densicy  f(c)  can  be  calculaCed  as: 


o.: 


in  hoc  0.0137c  0Ci  ,,  0.0137c. 

f(C)  =  I  0.3185  e  exp  23.25  (1  -  e  ) 


(156) 


(157) 


(158) 


(159) 


163.  In  a  similar  manner,  Che  reliabilicy  based  on  Che  failure  race  per 
mile  can  be  calculaCed  Co  be: 


R(c)  ■  exp  £4.577  (1  -  e^‘^3I7t)J  (160) 

Reliabilicy  curves  for  various  mission  Cimes  for  Equations  158  and  160  are 
plotted  in  Figure  18. 

164.  Determining  the  availability  of  a  water  main  is  substantially  more 
difficult  because  the  failure  rate  increases  as  pipe  age  increases.  Numerical 
integration  or  Laplace  transform  methods  may  be  used  to  compute  availability. 
However,  a  simplified  procedure  can  be  used  to  evaluate  water  main  availa¬ 
bility  if  a  constant  failure  rate  is  assumed.  For  example,  the  average  fail¬ 
ure  rate  for  the  above  cited  5-mile  pipe  link  can  be  estimated  from  Figure  19 
to  be  0.48.  Assuming  an  MTTR  of  16.460  hr  (0.69  days  or  0.0019  years),  the 
availability  can  be  calculated  as  follows: 


A(°°) 


MTTF  2.08 

MTTF  +  MTTR  =  2.08  +  0.0019 


0.999 


(161) 


An  availability  of  0.999  indicates  that,  on  average,  the  main  will  be  out  of 
service  approximately  9  hr  per  year. 
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PART  VI:  CONCLUSIONS  AND  RECOMMENDATIONS 


Conclusions 


165.  Based  on  this  study,  the  following  conclusions  can  be  drawn; 

a.  At  present  there  is  no  method  for  optimal  (minimum  cost) 
design  of  looped  distribution  systems  that  is  completely  sat¬ 
isfactory.  Existing  evaluation  methods  make  no  real  attempt 
to  explicitly  generate  and  evaluate  network  layouts  in  terms 
of  their  ultimate  importance  to  the  reliability  of  water 
service. 

b.  Previous  optimization  models  have  not  considered  multiple 
loadings  for  various  emergency  conditions  and  have  not 
attempted  to  incorporate  any  probabilistic  or  other  type  of 
reliability  measures  into  the  optimization. 

c.  The  previous  optimization  models  have  made  simplifications  in 
cost  equations  and  various  constraints  in  order  to  solve  the 
problem.  Also  because  of  the  methodologies  used,  there  are 
rather  severe  limitations  as  to  the  size  of  problem  (networks) 
that  can  be  solved. 

d.  There  is  presently  no  accepted  definition  or  measure  of  the 
reliability  of  water  distribution  systems. 

e.  Only  a  very  few  limited  studies  have  dealt  with  the  reliabil¬ 
ity  of  water  distribution  systems.  There  have  not  been  any 
procedures  or  methodologies  reported  in  the  literature  that 
could  be  used  to  evaluate  water  distribution  system  (network) 
reliability.  In  fact,  there  has  been  little  work  published  in 
the  literature  that  deals  with  component  (pump,  valve,  water 
mains,  storage,  etc.)  reliability. 

f_.  Most  of  the  published  literature  dealing  with  reliability 

analysis  that  could  be  extended  to  water  distribution  systems 
concerns  electrical  engineering  and  chemical  engineering. 

g.  The  time  to  failure  type  of  analysis  could  be  a  very  important 
tool  for  defining  water  distribution  system  reliability  and 
availability.  Both  the  repair-to-f ailure  process  and  the 
failure-to-repair  processes  should  be  used. 

h.  The  ideas  of  reliability  and  unreliability  (risk)  are  fairly 
common  terms  to  water  engineers;  however,  their  probabilistic 
meanings  are  not  so  well  known.  The  concepts  of  availability 
and  unavailability,  which  are  also  probabilistic,  are  actually 
more  important  and  should  also  be  considered  in  the  so-called 
reliability  analysis  of  water  distribution  networks. 

i.  Reliability  can  be  applied  to  the  fai lure-to-repair  process 
and  the  repair-to-f ailure  process;  however,  to  consider  the 
whole  process  (the  two  combined),  the  availability  and  unavai¬ 
lability  must  be  considered. 
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Component  reliability  analysis  methods  using  stress-strength 
analysis  can  be  used  to  determine  both  static  and  dynamic 
(time-dependent)  reliability.  These  concepts  also  have  appli¬ 
cation  to  the  reliability  analysis  of  components  of  water  dis¬ 
tribution  systems. 

k.  Simple  series-parallel  combination  systems  can  be  analyzed 
quite  simply  to  determine  system  parameters,  e.g.  reliability 
and  availability.  There  have  been  many  network  reliability 
evaluation  techniques  reported  in  the  literature.  Most  of  the 
techniques  have  only  been  tested  on  small  networks.  The  most 
promising  methods  seem  to  be  the  cut  set  and  path  enumeration 
methods  and  these  should  be  the  methods  considered  for  network 
reliability  evaluation  of  water  distribution  systems. 

l.  Another  concept  that  has  proven  to  be  very  useful  in  evaluat¬ 
ing  system  failure  is  the  fault  tree  analysis.  Fault  tree 
analysis  has  shown  to  be  very  useful  in  the  field  of  chemical 
engineering  for  the  reliability  analysis  of  various  types  of 
fluid  (chemical)  flow  processes  plants.  Of  particular  inter¬ 
est  is  the  ability  to  handle  time  delays  such  as  storage  tanks 
and  standby  redundancy.  The  fault  tree  approach  should  be 
further  investigated  for  application  to  water  distribution 
systems . 

m.  Even  though  a  great  deal  of  literature  has  been  published  on 
network  reliability  evaluation,  there  are  few  commercial  com¬ 
puter  codes  available  that  have  been  tested  and  proven. 

n.  The  computer  code  for  network  reliability  evaluation  that 
seems  most  promising  is  the  DOWNTIME  for  computing  upper  and 
lower  bounds  for  system  availability  from  reliability  block 
diagrams.  Other  available  codes  from  COADE  that  may  be  help¬ 
ful  are  the  MOCUS  or  PATH  CUT  codes  to  generate  the  minimal 
cut  sets  and  minimal  path  sets.  Then  use  the  SUPERPOCUS  code 
to  compute  system  reliability,  unavailability,  and  expected 
number  of  failures.  All  of  these  codes  are  available  for 
microcomputers  that  use  the  IBM  PC  operating  system. 

o.  Other  codes  that  may  be  of  interest  are  the  KITT-2  and  the 
K1TT-1T  codes.  Th*  KITT  codes  are  based  upon  the  kinetic  tree 
theory  for  determining  system  parameters.  The  KITT-1T  codes 
include  the  ability  to  incorporate  time  delays  provided  by 
storage  tanks  and  to  incorporate  component  (standby)  redun¬ 
dancy.  The  KITT-2  code  can  handle  increasing  failure  rates. 


Recommendations 


166.  Based  on  the  conclusions  presented  above,  the  following  recom¬ 
mendations  are  made: 

a.  Detailed  data  for  water  distribution  system  operation  should 
be  obtained.  These  data  should  then  be  used  to  preform 
detailed  studies  to  accomplish  the  following: 
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(1)  Derive  parameters,  probability  distributions,  and  failure 
'  rate  information  for  distribution  components  (pipes, 

pumps,  values,  storage,  and  other  control  devices). 

(2)  Based  upon  the  results  of  (1),  the  concepts  and  use  of 
the  following  system  parameters  should  be  investigated: 
reliability,  unreliability,  availability,  unavailability, 
system  failure  intensities,  mean  time  to  first  failure, 
expected  number  of  failures,  etc. 

(3)  Perform  a  sensitivity  analysis  of  the  various  parameters 
using  the  collected  data  and  possibly  using  generated 
data  through  a  Monte  Carlo  analysis. 

Computer  codes  such  as  DOWNTIME  and  KITT-1T  that  may  prove 
useful  for  water  distribution  system  analysis  should  be  eval¬ 
uated  using  the  data  obtained  in  a. 

Meaningful  water  distribution  system  reliability  and  availa¬ 
bility  measures  should  be  developed.  Specialized  computer 
software  for  the  reliability  and  availability  analysis  of  the 
various  system  components  should  be  developed. 
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Table  1 

Summary  of  Selected  Water  Distribution  Network  Solvers 


Hardy  Cross  Method 
Cross  (1936) 

Dillingham  (1967) 

McCormick  and  Bellamy  (1968) 

Gilman,  Goodman,  and  Metkowsi  (1971) 
DeMoyer,  Gilman,  and  Goodman  (1973) 

Newton  Raphson  Method 
Pitchai  (1966) 

Liu  (1968) 

Shamir  and  Howard  (1968) 

Epp  and  Fowler  (1970) 

Zarghamee  (1971) 

Lemieus  (1972) 

Stoner  (1972) 

Donachie  (1974) 

Combination  of  Newton  and  Hardy  Cross 
Liu  (1968) 

Graph  Theory  Method 

Kesavan  and  Chandrashekar  (1972) 

Lam  and  Wolla  (1972  a,  b) 


lacobv  (1968)  Nonlinear  programming  Pipe  diameters  Minimize  cost  of  pumps  Diameters  treated  as  Numerical  gradient  technique 
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Summary  of  Typical  Probability  Density  Functions* 


*  Reprinted  by  permission  of  Prentice-Hall,  Inc.,  Englewood  Cliffs,  N.J.,  Reliability  Engineering  and 
Risk  Assessment,  1981,  p  208. 


p 


Table  4 


Relations  Among  Probabilistic  Parameters* 


(1) 

(2) 
(3) 


o 

Ctf 


(5) 


2. 
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(6) 

(7) 
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o  3 
a  3 
a  > 

on 


(12) 

(13) 

(14) 

(15) 

(16) 

(17) 

(18) 

(19) 

(20) 


Repairable 


Non-repairable 


A(l)+Q0)- 1 
A(t)>R(t) 

Q(o<m 

*K‘)-Kt)+  f'At-u)v(u)du 
•'0 

«K0-  f'g(t~u)w(u)du 
Jo 

H'(l,l  +  dl)~w(l)dl 
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J‘\ 

Q(t)~  W(0,t)-  V(0,t) 

X(,)  i-e«) 
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WMJ-  fhMu)du 

Jh 
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*  Reprinted  by  permission  of  Prentice-Hall,  Inc.,  Englewood  Cliffs,  N.J., 
Reliability  Engineering  and  Risk  Assessment,  1981,  p  187. 


Table  5 

Reliability  and  Maintainability  by  Subsystem  Type 
(Shultz  and  Parr  1981) 


MTBF 

A 

MTTR 

Subsystem 

(x  10  hr) 

(hr) 

Pumps 

0.032066 

9.541 

Power  transmission 

0.035620 

2.273 

Motors 

0.066700 

6.854 

Valves 

0.014440 

11.615 

Controls 

0.083580 

3.696 

Table  6 

Subsystem  Reliability  and  Maintainability  by  Generic  Grou 
(Shultz  and  Parr  1981) 


Subsystem 

MTBF 

(x  106  hr) 

MTTR 

(hr) 

Pumps 

Centrifugal,  open  impeller 

0.021660 

7.825 

Axial  flow,  propeller 

0.074191 

16.780 

Power  transmission 

Concentric  reducer 

0. 122640 

2.000 

Parallel  shaft 

0.710910 

32.000 

Right  angle  shaft 

0.019480 

1.400 

Vertical  shaft 

0.031470 

2.023 

Variable  speed,  hydraulic 

0.349500 

— 

Variable  speed,  other 

0.014200 

2.500 

Gear  box 

0.045780 

3.530 

Chain  drive 

0.017850 

8.000 

Belt  drive 

0.091010 

1.800 

Motors 

Multiphase 

0.068000 

6.853 

Variable  speed,  AC 

0.  1  14820 

8.000 

Gas  engine 

0.023800 

24.000 

Valves 

Gate 

1 ' . MOSM  U\ 

3.636 

Ball 

• ' .  •)  ‘  ; 

— 

Butterfly 

i.  1 

1.000 

Plug 

i 1 .  n.'HS.’o 

— 

Controls 

Electrical 

0.  100640 

2.893 

Mechanical 

0.031230 

8.000 

Pressure  (fluid) 

0.035780 

8.236 

Pressure  (air) 

0.018600 

3.556 

2/2 
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Table  7 

Subsystem  Reliability  and  Maintainability  by  Size 
(Shultz  and  Parr  1981) 


Subsystem 

MTBF 

(*  106  hr) 

MTTR 

(hr) 

Pumps ,  gpm 

t- 10, 000 

0.039600 

6.786 

10,001-20,000 

0.031100 

7.800 

20,001-100,000 

0.081635 

26.722 

Over  100,000 

0.008366 

9.368 

Power  transmission,  hp 

0-1 

0.025370 

1.815 

2-5 

0.011010 

2.116 

6-25 

1.376400 

25.000 

26-100 

0.058620 

5.000 

101-500 

0.078380 

2.600 

Over  500 

0.206450 

32.000 

Motors,  hp 

0-1 

0.206450 

2.600 

2-5 

0.214700 

— 

6-25 

0.565600 

7.857 

26-100 

0.062100 

4.967 

101-500 

0.046000 

12.685 

Over  500 

0.064630 

7.658 

Valves,  In. 

6-12 

0.054590 

— 

13-24 

0.010810 

1.000 

25-48 

0.019070 

42.000 

Over  48 

0.007500 

2.667 

Controls,  hp 

0-1 

2.009200 

2.050 

2-5 

0.509500 

— 

6-25 

4.684900 

— 

26-100 

0.026109 

2.377 

101-500 

0.099340 

5.450 

Over  500 

0.037700 

3.125 

Table  9 

Gate  Symbols  (Henley  and  Kumamoto  1981)* 


GATE  SYMBOL 


GATE  NAME 


CAUSAL  RELATION 


1 


AND  GATE 


OUTPUT  EVENT  OCCURS  IF 
ALL  INPUT  EVENTS  OCCUR 
SIMULTANEOUSLY. 


2 


OR  GATE 


OUTPUT  EVENT  OCCURS  IF 
ANY  ONE  OF  THE  INPUT 
EVENTS  OCCURS 


INPUT  PRODUCES  OUTPUT 
WHEN  CONDITIONAL  EVENT 
OCCURS. 


PRIORITY 

AND 

GATE 


OUTPUT  EVENT  OCCURS  IF 
ALL  INPUT  EVENTS  OCCUR 
IN  THE  ORDER  FROM  LEFT 
TO  RIGHT. 


5 


EXCLUSIVE 

OR 

GATE 


OUTPUT  EVENT  OCCURS  IF 
ONE,  BUT  NOT  BOTH.  OF 
THE  INPUT  EVENTS  OCCUR. 


I 


6 


OUT  OF 
n  GATE 
IVOTING  OR 
SAMPLE  GATEI 


OUTPUT  EVENT  OCCURS  IF 
m  OUT  OF  n  INPUT  EVENTS 
OCCUR. 


Reprinted  by  permission  of  Prentice-Hall,  Inc.,  Englewood  Cliffs,  N.J 
Reliability  and  Risk  Assessment,  1981,  p  49. 


Table  10 

Event  Symbols  (Henley  and  Kumamoto  1981)* 


Reprinted  by  permission  of  Prent Ice-Hal 1 ,  Inc., 
Reliability  and  Risk  Assessment,  1981,  p  55. 
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Table  12 

Reliability,  Unreliability,  Availability,  and  Unavailability 
Data  for  A  =  0.0008/hr  and  u  =  0.02/hr 


liability 

Unreliability 

Availability 

Unavailabili 

1. 

0. 

1. 

0. 

0.9917 

0.0083 

0.9925 

0.0075 

0.9834 

0.0166 

0.9637 

0.0363 

0.9753 

0.0247 

0.9814 

0.0186 

0.9672 

0.0328 

0.9774 

0.0226 

0.9592 

0.0408 

0.9741 

0.0259 

0.9200 

0.0800 

0.9650 

0.0350 

0.8465 

0.1535 

0.9606 

0.0394 

0.7788 

0.2212 

0.9601 

0.0399 

0.7165 

0.2835 

0.9600* 

0.0400* 

0.6592 

0.3408 

0.9600 

0.0400 

0.4346 

0.5654 

0.9600 

0.0400 

0.1889 

0.8111 

0.9600 

0.0400 

0.0821 

0.9179 

0.9600 

0.0400 

0.0357 

0.9643 

0.9600 

0.0400 

0.0155 

0.9845 

0.9600 

0.0400 

0.0002 

0.9998 

•  0.9600 

0.0400 

*  Point  of  stationary  availability  and  unavailability 


I 


ft 


in  m  cn  <n 


N  n  •»<  co  n  ^ 

cm  O  O  O'  o*  O' 

N  (N  N  — *  •— <  ■— • 


O'  O'  O' 


cm  r-i  <t 


cm  «n  co  — 


O  O  O  O 


pm  *n  oo  — * 
r**  sc  *n  m 


cn  m  (N 


O 

© 

o 


mm—* 


so  n  in 


in  in  m  m  m 


-*0  0  0  0  0 


OOOOOOOOOOO 


rg^^^rsicMfntnm 


O' 

o’ 


00000 


OOOOOO 


OS  Qo  00 


— <  m  in 


— *  m  in 


— *  co  m  fn 

Os  00  co  00  00 


f*N  so 

o\  — 

-  n  n 


o 

so 

o 


OOOOOOOOOOOOOO  OOOOO 


— *  m  O'  00 

CM  O  Os  O' 

os  O'  00  CO 

Os  O'  Os  0 


OOOOOOO 


OOOOOO 


<N  O' 

o  — 

m  — 


Os  co  n- 


<y  — «  ac  m 


O  -  in  n 


OOOOOO 


o  o 
o  o 
—  o 


S’-  *  ,  » 


iZ*  r m  *i 


.v 

‘.-wJV- 

Vs’.v.N 

•  *  m 

v/\ 


-  —  ‘--4 

ft 


.»  *  ~  . 


•- 


,  *  .  *  *  >  1 

*_r* • 1 


1 

.*■  .> 

V-V.'Ii 


■»  *,  •.  *  ■  *  •* * • 
.  4,*  ■  *a  »:  V  '  .*  - 


>>.v 


Chemical  Engineering 


J.R.  Campbell  and  J.L.  Gaddy,  "Methodology  for  Simultaneous  Optimization  with 
Reliability:  Nuclear  PWR  Example,"  AlChE  Journal,  Vol.  22,  No.  6,  November 
1976,  pp.  1050-1055. 

W.-B.  Cheng  and  R.S.H.  Mah,  "Optimal  Design  of  Pressure  Relieving  Piping  Networks 
by  Discrete  Merging,"  AlChE  Journal,  Vol.  22,  No.  3,  May  1976,  pp.  471-476. 

E.J.  Henley  and  S.L.  Gandhi,  "Process  Reliability  Analysis,"  Department  of  Chemical 
Engineering,  University  of  Houston,  Houston,  Texas.  AlChE  Journal,  Vol.  21,  No. 
4,  July  1975,  pp.  677-686. 

H.  Kumamoto  and  E.J.  Henley,  "Safety  and  Reliability  Synthesis  of  Systems  with 
Control  Loops,"  AlChE  Journal,  Vol.  25,  No.  1,  January  1979,  pp.  108-113. 

B.A.  Murtagh,  "An  Approach  to  the  Optimal  Design  of  Networks,"  Computer  Sciences 
Department,  Fluor  Corp.,  Los  Angeles,  CA.  Chemical  Engineering  Science,  Vol. 
27,  1972,  pp.  1131-1141. 

G.J.  Powers,  D.L.  Cummings,  and  R.N.S.  Rathore,  "Synthesis  of  Fault  Tolerant 
Reaction  Paths,"  AlChE  Journal,  Vol.  21,  No.  1,  January  1975,  pp.  90-95. 

G.J.  Powers  and  F.C.  Tompkins,  Jr.,  "Fault  Tree  Synthesis  for  Chemical  Processes," 
AlChE  Journal,  Vol.  20,  No.  2,  March  1974,  pp.  376-387. 

J.  Weisman  and  A.G.  Holzman,  "Optimal  Process  System  Design  Under  Conditions  of 
Risk,"  Ind,  Eng.  Chem.  Process  Des.  Develop.,  VoK  1 1,  No.  3,  1972.  pp.  386-397. 

G.D.  Whitaker,  "Statistical  Reliability  Models  for  Chemical  Process  Plant,"  The 
Chemical  Engineer,  October  1973,  pp.  471-477. 


Component  Reliability 

B.S.  Dhillon,  "Mechanical  Component  Reliability  Under  Environmental  Stress,"  Micr¬ 
oelectronics  and  Reliability,  Pergamon  Press,  Vol.  20,  pp.  153-160,  1980.  Printed 
in  Canada. 

P.A.  Stancampiano,  "Monte  Carlo  Approaches  to  Stress-Strength  Interference,"  Swan¬ 
son  Engineering  Associates  Corporation,  McMurray,  Pennsylvania,  pp.  197-212. 


-  >  V*- 


:z«i 


**.  >.v] 


Fault  Trees 

R.G.  Bennetts,  "On  the  Analysis  of  Fault  Trees,"  IEEE  Transactions  on  Reliability,  Vol. 
R-24,  No.  3,  August  1975,  pp.  175-185. 

R. L.  Browning,  "Human  Factors  in  the  Fault  Tree,"  CEP,  3une  1976,  pp.  72-75. 

A.C.  Gangadharan,  M.S.M.  Rao,  and  C.  Sundararajan,  "Computer  Methods  for  Quali¬ 
tative  Fault  Tree  Analysis,"  Foster  Wheeler  Development  Corporation,  Living¬ 
ston,  New  Jersey,  pp.  251-262. 

C.L.  Innis,  Jr.,  and  T.  Hammond,  "Predicting  Mechanical  Design  Reliability  Using 
Weighted  Fault  Trees,"  pp.  213-228. 

S. L.  Salem,  G.E.  Apostolakis,  and  D.  Okrent,  "A  New  Methodology  for  the  Computer- 

Aided  Construction  of  Fault  Trees,"  Chemical,  Nuclear  and  Thermal  Engineering 
Dept.,  School  of  Engineering  and  Applied  Sciences,  University  of  California,  Los 
Angeles,  California,  90024,  U.S.A.  pp.  417-433. 

W.A.  Wolfe,  "Fault  Trees  Revisited,"  Microelectronics  and  Reliability,  Pergamon 
Press,  Vol.  17,  pp.  117-128,  1978. 


*  •  . 


*  «  «  1 


V.v. 


Hydraulic  Structural  Risk 


A. H.-S.  Ang,  "Probability  Considerations  in  Design  and  Formulation  of  Safety  Fac¬ 

tors,"  March  1971,  University  of  Illinois,  Urbana,  Illinois,  pp.  13-23. 

L.  Duckstein  and  I.  Bogardi,  "Application  of  Reliability  Theory  to  Hydraulic  Engi¬ 
neering  Design,"  Journal  of  the  Hydraulics  Division,  ASCE,  Vol.  107,  No.  HY7, 
July  1981,  pp.  799-815. 

H.-L.  Lee  and  L.W.  Mays,  "Advances  in  Risk  and  Reliability  Analysis  for  Hydraulic 
Structures,"  Presented  at  the  1983  ASCE  Hydraulics  Division  Specialty  Con¬ 
ference,  MIT,  Cambridge,  Massachusetts,  August  9-12,  1983.  7pp. 

H.-L.  Lee  and  L.W.  Mays,  "Improved  Risk  and  Reliability  Model  for  Hydraulic 
Structures,"  Water  Resources  Research,  Vol.  19,  No.  6,  December  1983,  pp. 
1415-1422. 

L.B.  Leopold,  "Probability  Analysis  Applied  to  a  Water-Supply  Problem,"  Geological 
Survey  Circular  410,  Washington,  D.C.,  1959.  18pp. 

N.C.  Lind,  "Optimal  Reliability  Analysis  by  Fast  Convolution,"  Journal  of  the 
Engineering  Mechanics  Division,  ASCE,  Vol.  105,  No.  EM3,  June  1979,  pp.  447- 
452. 

L.W.  Mays,  "Optimal  Design,  of  Culverts  Under  Uncertainties,"  Journal  of  the 
Hydraulics  Division,  ASCE,  Vol.  105,  No.  HY5,  May  1979,  pp.  443-460. 

Y.-K.  Tung  and  L.W.  Mays,  "Optimal  Risk-Based  Hydraulic  Design  of  Bridges,"  Journal 
of  the  Water  Resources  Planning  and  Management  Division,  ASCE,  Vol.  108,  No. 
WR2,  June  1982,  pp.  191-202. 

Y.-K.  Tung  and  L.W.  Mays,  "Risk  Analysis  for  Hydraulic  Design,"  Journal  of  the 
Hydraulics  Division,  ASCE,  Vol.  106,  No.  HY5,  May  1980,  pp.  893-913. 

Y.-K.  Tung  and  L.W.  Mays,  "Optimal  Risk-Based  Design  of  Hydraulic  Structures  for 
Urban  Drainage,"  Second  International  Conference  on  Urban  Storm  Drainage, 
Urbana,  Illinois,  June  14-19,  1981.  pp.  421-430. 

R.P.S.  Yadav,  "A  Reliability  Model  for  Stress  vs  Strength  Problem,"  Microelectronics 
and  Reliability,  Pergamon  Press,  Vol.  12,  1973,  pp.  119-123. 

B. C.  Yen  and  W.H.  Tang,  "Assessment  of  Data  Worth  in  Water  Resources  Projects," 

Proceedings  of  the  18th  Congress,  September  1979,  Cagliari,  Italy,  Vol.  2,  pp. 
507-514.  International  Association  for  Hydraulic  Research. 


T-^TV^.  T-, 


Mechanical  Engineering 

D.  Kececioglu,  "Probabilistic  Design  Methods  for  Reliability  and  Their  Data  and  Re¬ 
search  Requirements,"  The  University  of  Arizona,  Aerospace  &  Mechanical 
Engineering  Dept.,  Tucson,  Arizona,  pp.  285-309. 

D.B.  Parkinson,  "The  Application  of  Reliability  Methods  to  Tolerancing,"  Transactions 
of  the  ASME,  Vol.  104,  July  1982,  pp.  612-618. 

M.N.  Gopalan  and  P.  Venkateswarlu,  "Reliability  Analysis  of  Time-Dependent  Cascade 
System  With  Deterministic  Cycle  Times,"  Microelectron  Reliability,  Vol.  22,  No. 
4,  pp.  841-872.  1982  Pergamon  Press,  Printed  in  Great  Britain. 

B.  Littlewood,  "How  to  Measure  Software  Reliability  and  How  Not  To,"  IEEE 
Transactions  on  Reliability,  Vol.  R-28,  No.  2,  3une  1979,  pp.  103-110. 


A7 


Network  Reliability 


J. A.  Abraham,  "An  Improved  Algorithm  for  Network  Reliability,"  IEEE  Transactions  on 

Reliability,  Vol.  R-28,  No.  1,  April  1979,  pp.  58-61. 

K. K.  Aggarwal,  Y.C.  Chopra,  and  J.S.  Bajwa,  "Reliability  Evaluation  by  Network 

Decomposition,"  IEEE  Transactions  on  Reliability,  Vol.  R-31,  No.  4,  October 
1982,  pp.  355-358. 

K.K.  Aggarwal,  J.S.  Gupta,  and  K.B.  Misra,  "A  New  Method  for  System  Reliability 
Evaluation,"  Microelectronics  and  Reliability,  Vol.  12,  pp.  435-440.  Pergamon 
Press  1973.  Printed  in  Great  Britain. 

K. K.  Aggarwal,  K.B.  Misra,  and  J.S.  Gupta,  "A  Fast  Algorithm  for  Reliability 

Evaluation,"  IEEE  Transactions  on  Reliability,  Vol.  R-24,  No.  1,  April  1975,  pp. 
83-85. 

H.  Ahmad,  "A  Simple  Technique  for  Computing  Network  Reliability,"  IEEE  Tran¬ 
sactions  on  Reliability,  Vol.  R-31,  No.  1,  April  1982,  pp.  41-44. 

M.A.  Beg,  "Estimation  of  Pr(Y  <  X)  for  Exponential-Family,"  IEEE  Transactions  of 
Reliability,  Vol.  R-29,  No.  2,  June  1980,  pp.  158-159. 

R.G.  Bennetts,  "Analysis  of  Reliability  Block  Diagrams  by  Boolean  Techniques,"  IEEE 
Transactions  on  Reliability,  Vol.  R-31,  No.  2,  June  1982,  pp.  159-166. 

H.S.  Blanks,  "A  Review  of  New  Methods  and  Attitudes  in  Reliability  Engineering," 
Microelectronics  and  Reliability,  Pergamon  Press,  1973.  Vol.  12,  pp.  301-319. 

R.C.  Bollinger,  "Direct  Computation  for  Consecutive-k-out-of-n:  F  Systems,"  IEEE 
Transactions  on  Reliability,  Vol.  R-31,  No.  5,  December  1982.  pp.  444-445. 

D.B.  Brown,  "A  Computerized  Algorithm  for  Determining  the  Reliability  of  Redundant 
Configurations,"  IEEE  Transactions  on  Reliability,  Vol.  R-20,  No.  3,  August  1971. 
pp.  121-124. 

T.  Case,  "A  Reduction  Technique  for  Obtaining  a  Simplified  Reliability  Expression," 
IEEE  Transactions  on  Reliability,  Vol.  R-26,  No.  4,  October  1977,  pp.  248-249. 

D.K.  Chow,  "Reliability  of  Two  Items  in  Sequence  with  Sensing  and  Switching,"  IEEE 
Transactions  on  Reliability,  Vol.  R-20,  No.  4,  November  1971,  pp.  254-256. 

B.L.  Deuermeyer,  "A  New  Approach  for  Network  Reliability  Analysis,"  IEEE  Tran¬ 
sactions  on  Reliability,  Vol.  R-31,  No.  4,  October  1982,  pp.  350-354. 

L. C.  Frair,  P.M.  Ghare,  and  K.L.  Frair,  "Optimization  of  System  Reliability  via  Re¬ 

dundancy  and/or  Design  Considerations,"  IEEE  Transactions  on  Reliability,  Vol. 
R-29,  No.  1,  April  1980.  pp.  33-37.  '  "  ~  '  . . 

L.  Fratta  and  U.G.  Montanari,  "A  Boolean  Algebra  Method  for  Computing  the  Terminal 
Reliability  in  a  Communication  Network,"  IEEE  Transactions  on  Circuit  Theory, 
Vol.  CT-20,  No.  3,  May  1973,  pp.  203-21  1. 


J.N.  Hagstrom,  "Using  the  Decomposition-Tree  of  a  Network  in  Reliability  Compu¬ 
tation,"  IEEE  Transactions  on  Reliability,  Vol.  R-32,  No.  1,  April  1983,  pp.  71-78. 

E.  Hansler,  G.K.  McAuliffe,  and  R.S.  Wilkov,  "Exact  Calculation  of  Computer  Network 

Reliability,"  Networks,  4:95-112,  copyright  1974  by  3ohn  Wiley  &  Soncs,  Inc. 

3.C.  Hudson  and  K.C.  Kapur,  "Reliability  Theory  for  Multistate  Systems  with  Multi¬ 
state  Components,"  Microelectronics  Reliability,  Vol.  22,  No.  1,  pp.  1-7,  1982, 
Printed  in  Great  Britain,  Pergamon  Press  Ltd. 

F. K.  Hwang,  "Fast  Solutions  for  Consecutive-k-out-of-n:  F  System,"  IEEE  Tr¬ 

ansactions  on  Reliability,  Vol.  R-31,  No.  5,  December  1982.  pp.  447-448. 

C.L.  Hwang,  F.A.  Tillman,  and  M.H.  Lee,  "System-Reliability  Evaluation  Techniques 
for  Complex/Large  Systems — A  Review,"  Kansas  State  University,  Manhattan. 
IEEE  Transactions  on  Reliability,  Vol.  R-30,  No.  5,  pp.  416-423,  December  1981. 

3.M.  Kontoleon,  "An  Algorithm  for  Evaluating  Overall  Reliability  of  Special  Tree- 
Networks,"  IEEE  Transactions  of  Reliability,  Vol.  R-29,  No.  4,  October  1979.  pp. 
314. 

Kontoleon,  "Reliability  Improvement  of  Multiterminal  Networks,"  IEEE  Transactions 
on  Reliability,  Vol.  R-29,  No.  1,  April  1980,  pp.  75-76. 

S.  Kumar,  "Some  Reliability  Routing  Problems  in  an  Acyclic  Directed  Network  with 
Stochastic  Terminal  Node,"  Microelectron  Reliability,  Vol.  22,  No.  1.  pp.  31-41, 
1982  Pergamon  Press,  Printed  in  Great  Britain. 

5.H.  Lee,  "Reliability  Evaluation  of  a  Flow  Network,"  IEEE  Transactions  on  Rel¬ 
iability,  Vol.  R-29,  No.  1,  April  1980,  pp.  24-26. 

K.B.  Misra,  "An  Algorithm  for  the  Reliability  Evaluation  of  Redundant  Networks," 
IEEE  Transactions  on  Reliability,  Vol.  R-19,  No.  4,  November  1970,  pp.  146-151. 

K.B.  Misra,  "A  Method  of  Redundancy  Allocation,"  Microelectronics  and  Reliability, 
Pergamon  Press.  Vol.  12,  pp.  389-393. 

K.B.  Misra,  "Reliability  Design  of  A  Maintained  System,"  Microelectronics  and 
Reliability,  Vol.  13,  No.  6,  pp.  495-500.  Pergamon  Press,  1974. 

K.B.  Misra  and  C.E.  Carter,  "Redundancy  Allocations  In  A  System  With  Many  Stages," 
Microelectronics  and  Reliability,  Pergamon  Press  1973.  Vol.  12,  pp.  223-228. 

K.B.  Misra  and  P.  Prasad,  Comment  on  "Reliability  Evaluation  of  a  Flow  Network," 
IEEE  Transactions  on  Reliability,  Vol.  R-31,  No.  2,  3une  1982,  pp.  174-176. 

K.B.  Misra  and  T.S.M.  Rao,  "Reliability  Analysis  of  Redundant  Networks  Using  Flow 
Graphs,"  IEEE  Transactions  on  Reliability,  Vo!.  R-19,  No.  1,  February  1970.  pp. 
19-24. 

K.D.  Narasimhalu  and  M.  Sivaramakrishnan,  "A  Rapid  Algorithm  for  Reliability 
Optimization  of  Parallel  Redundant  Systems,"  IEEE  Transactions  on  Reliability, 
Vol.  R-27,  No.  4,  October  1978,  pp.  261-263. 


A  9 


Y.B.  Ness  and  H.  Livni,  "Reliability  Optimization  in  the  Design  of  Telephone 
Networks,"  IEEE  Transactions  on  Reliability,  Vol.  R-27,  No.  5,  December  1978, 
pp.  315-328. 

S.  Rai  and  K.K.  Aggarwal,  "Path  Enumeration  Using  Flow  Graphs,"  Microelectronics 
Reliability,  Vol.  19,  pp.  391-393.  Pergamon  Press,  1979. 

A.  Satyanarayana  and  A.  Prabhakar,  "New  Topological  Formula  and  Rapid  Algorithm 
for  Reliability  Analysis  of  Complex  Networks,"  IEEE  Transactions  on  Reliability, 
Vol.  R-27,  No.  2,  June  1978,  pp.  82-100. 

W.G.  Schneeweiss  and  K.D.  Heidtmann,  "Synthesis  and  Number  of  all  Series-Parallel 
Structures  with  n  Components,"  IEEE  Transactions  of  Reliability,  Vol.  R-29,  No. 

4,  October  1980.  pp.  315-319. 

3.G.  Shanthikumar,  "Recursive  Algorithm  to  Evaluate  the  Reliability  of  a  Consecutive- 
k-out-of-n:  F  System."  IEEE  Transactions  on  Reliability,  Vol.  R-31,  No.  5, 
December  1982.  pp.  442-443. 

Y.S.  Sherif,  "Optimal  Maintenance  Schedules  of  Systems  Subject  to  Stochastic 
Failure,"  Microelectron  Reliability,  Vol.  22,  No.  1,  pp.  15-29.  1982  Pergamon 

Press,  Printed  in  Great  Britain. 

Y.S.  Sherif,  "Reliability  Analysis:  Optimal  Inspection  and  Maintenance  Schedules  of 
Failing  Systems,"  Microelectron  Reliability,  Vol.  22,  No.  1,  pp.  59-115.  1982 

Pergamon  Press,  Printed  in  Great  Britain. 

C.  Singh,  "A  Matrix  Approach  to  Calculate  the  Failure  Frequency  and  Related 
Indices,"  Microelectronics  Reliability,  Vol.  19,  pp.  395-398,  Pergamon  Press  Ltd. 
1979,  Printed  in  Great  Britain. 

N.  Singh,  "Distribution  of  Time  to  Non-Availability  of  a  Reliability  System,"  ^ 
Microelectronics  and  Reliability,  Pergamon  Press  1973.  Vol.  12,  pp.  337-342. 

P.R.  Tadikamalla,  "Age  Replacement  Policies  for  Weibul!  Failure  Times,"  IEEE 
Transactions  on  Reliability,  Vol.  R-29,  No.  1,  April  1980,  pp.  88-90. 

F.A.  Tillman,  C.-L.  Hwang,  L.-T.  Fan,  and  K.C.  Lai,  "Optimal  Reliability  of  a  Complex 
System,"  IEEE  Transactions  on  Reliability,  Vol.  R-29,  No.  3,  August  1970,  pp. 
95-100. 

J.  Torrey,  "A  Pruned  Tree  Approach  to  Reliability  Computation,"  IEEE  Transactions  on 
Reliability,  Voi.  R-32,  No.  2,  June  1983,  pp.  170-174. 

A.  Winterbottorn  and  J.L.  Verrall,  "Confidence  Limits  for  System  Reliability:  A  Se¬ 
quential  Method,"  IEEE  Transactions  on  Reliability,  Vol.  R-20,  No.  4,  November 
1971.  pp.  204-21 !. 


B.P.  Zelentsov,  "Reliability  Analysis  of  Large  Nonrepairable  Systems,"  IEEE  Trans¬ 
actions  on  Reliability,  Vol.  R-19,  No.  4,  November  1970.  pp.  1  52-1  56. 


Nuclear  Engineering 

3.3.  Burns,  3r.,  "Reliability  of  Nuclear  Power  Plant  Steam  Turbine  Overspeed  Control 
System,"  U.S.  Nuclear  Regulatory  Commission,  pp.  27-38. 

3.  de  S.  Coutinho,  "Reliability  and  Maintainability  Research  Needs  In  Design  Control 
and  Risk  Management,"  U.S.  Army  Material  Systems  Analysis  Activity,  Aberdeen 
Proving  Ground,  Aberdeen,  Maryland,  pp.  269-283. 

K.C.  Hayden,  "Common-Mode  Failure  Mechanisms  in  Redundant  Systems  Important  to 
Reactor  Safety,"  Control  and  Instrumentation,  ed.  by  E.W.  Hagen,  Nuclear 
Safety,  Vol.  17,  No.  6,  November-December  1976,  pp.  686-693. 

A.  Laviron,  A.  Carnino,  and  3.C.  Manarache,  "ESCAF  -  A  New  and  Cheap  System  for 
Complex  Reliability  Analysis  and  Computation,"  French  Atomic  Energy  Com¬ 
mission  (CEA),  Valduc  and  Paris.  IEEE  Transactions  on  Reliability,  Vol.  R-31, 
No.  4,  pp.  339-349,  October  1982. 

G.W.  Parry  and  P.W.  Winter,  "Characterization  and  Evaluation  of  Uncertainty  in 
Probabilistic  Risk  Analysis,"  Accident  Analysis,  ed.  by  P.M.  Haas,  Nuclear 
Safety,  Vol.  22,  No.  1,  3anuary-February  1981,  pp.  28-42. 

l.B.  Wall,  "Probabilistic  Assessment  of  Flooding  Hazard  for  Nuclear  Power  Plants," 
Accident  Analysis,  Nuclear  Safety,  Vol.  15,  No.  4,  3uly-August  1974,  pp. 
399-408. 

l.B.  Wall,  "Probabilistic  Assessment  of  Aircraft  Risk  for  Nuclear  Power  Plants,"  ed.  by 
3.R.  Engel,  Nuclear  Safety,  Vol.  15,  No.  3,  May-3une  1974,  pp.  276-284. 


19 74.  Printed  in  Great  Britain. 


D.K.  Chow,  "Reliability  of  Two  Items  in  Sequence  with  Sensing  and  Switching,"  IEEE 
Transactions  on  Reliability,  Vol.  R-20,  No.  4,  November  1971,  pp.  254-256. 

R. C.  Garg  and  C.M.  Sharma,  "Stand-By  Redundancy  Complex  System's  Reliability," 

Microelectronics  and  Reliability,  Pergamon  Press  1973,  Vol.  12,  pp.  321-328. 
Printed  in  Great  Britain. 

K.  Gopal,  K.K.  Aggarwal,  and  J.S.  Gupta,  "An  Improved  Algorithm  for  Reliability 
Optimization,"  IEEE  Transactions  on  Reliability,  Vol.  R-27,  No.  5,  December 
1978,  pp.  325-328. 

K.  Gopal,  K.K.  Aggarwal,  and  J.S.  Gupta,  "A  New  Approach  to  Optimal  Redundancy 
Allocation  for  Complex  Networks,"  Microelectronics  Reliability,  Vol.  19,  pp. 
387-390.  Pergamon  Press,  Ltd.,  1979.  Printed  in  Great  Britain. 

3.Y.  Kim  and  L.C.  Frair,  "Optimal  Reliability  Design  By  a  Transformation  Technique," 
IEEE  Transactions  on  Reliability,  Vol.  R-32,  No.  1,  April  1983,  pp.  102-105. 

W.  Kuo,  C.-L.  Hwang,  and  F.A.  Tillman,  "A  Note  on  Heuristic  Methods  in  Optimal 
System  Reliability,"  IEEE  Trasactions  on  Reliability,  Vol.  R-27,  No.  5,  December 
1978,  pp.  320-324. 

K.B.  Misra,  "A  Fast  Method  for  Redundancy  Allocation,"  Microelectronics  and 
Reliability,  Pergamon  Press  1973,  Vol.  12,  pp.  385-387.  Printed  in  Great  Britain. 

K.B.  Misra  and  3.  Sharma,  "Reliability  Optimization  of  a  System  by  Zero-One 
Programming,"  Microelectronics  and  Reliability,  Pergamon  Press  1973,  Vol.  12, 
pp.  229-233.  Printed  in  Great  Britain. 

Narasimhalu  and  Sivaramakrishnan,  "A  Rapid  Algorithm  lor  Reliability  Optimisation  of 
Parallel  Redundant  Systems,"  IEEE  Transactions  on  Reliability,  Vol.  R-27,  No.  4, 
October  1978,  pp.  261-263. 

D.S.  Necsulescu  and  M.  Krieger,  "Reliability  Optimization — A  Case  Study,"  IEEE 
Transactions  on  Reliability,  Vol.  R-31,  No.  1,  April  1982,  pp.  101-104. 

Y.B.  Ness  and  H.  Livni,  "Reliability  Optimization  in  the  Design  of  Telephone 
Networks,"  IEEE  Transactions  on  Reliability,  Vol.  R-27,  No.  5,  December  1978, 
pp.  315-319. 

S.  Prakash,  "Analytic  Study  of  a  Stand-By  Redundant  Equipment  with  Switching  and 

Shelf  Life  Failures,"  Microelectronics  and  Reliability,  Pergamon  Press  1973, 
Vol.  12,  pp.  329-335.  Printed  in  Great  Britain. 

R.  Sasaki,  T.  Okada,  and  S.  Shingai,  "A  New  Technique  to  Optimize  System  Reli¬ 
ability,"  IEEE  Transactions  on  Reliability,  Vol.  R-32,  No.  2,  June  1983,  pp.  175- 


P.  Sen,  "Optimization  of  a  Reliability  Growth  Problem,"  Microelectronics  and  R- 
eliability,  Vol.  13,  pp.  87-89,  Pergamon  Press  1974.  Printed  in  Great  Britain. 

F.A.  Tillman,  C.-L.  Hwang,  and  W.  Kuo,  "Determining  Component  Reliability  and 
Redundancy  for  Optimum  System  Reliability,"  IEEE  Transactions  on  Reliability, 
Vol.  R-26,  No.  3,  August  1977,  pp.  162-165. 

F.A.  Tillman,  C.-L.  Hwang,  and  W.  Kuo,  "Optimization  Techniques  for  System 
Reliability  with  Redundancy — A  Review,"  IEEE  Transactions  on  Reliability,  Vol. 
R-26,  No.  3,  August  1977,  pp.  148-155. 


F.A.  Tillman,  C.-L.  Hwang,  L.-T.  Fan,  and  K.C.  Lai,  "Optimal  Reliability  of  a  Complex 
System,"  IEEE  Transactions  on  Reliability,  Vol.  R-19,  No.  3,  August  1970,  pp. 
95-100. 


Papers  from  1972  Reliability  and  Maintainability  Symposium 

R.A.  Brumback  and  R.A.  Kowalski,  "The  Reliability  of  Multiplexed  Systems,"  West- 
inghouse  Electric  Corporation,  Baltimore,  Maryland  21203,  pp.  68-77. 

J.B.  deMercado,  "Reliability  Prediction  Studies  of  Degradation  -  Prone  Systems," 
Canadian  Government,  Department  of  Communications,  Canada,  pp.  200-204. 

J.C.  Dickson,  J.L.  Hesse,  A.C.  Kientz,  and  M.L.  Shooman,  "Quantitative  Analysis  of 
Software  Reliability,"  SAFEGUARD  System  Evaluation  Agency,  White  Sands 
Missile  Range,  N.M.,  Polytechnic  Institute  of  Brooklyn,  Brooklyn,  N.Y.,  pp. 
148-157. 

M.J.  Flynn  and  S.A.  Szygenda,  "A  Perspective  on  Computer  Reliability,"  Computer 
Science  Department,  Johns  Hopkins  University,  Baltimore  Maryland  21218, 
Computer  Sciences/Operations  Research  Center,  Southern  Methodist  University, 
Dallas,  Texas  75222,  pp.  166-173. 

H.C.  Jones,  "Reliability  Modeling,"  Proceedings,  1972  Annual  Reliability  and  Main¬ 
tainability  Symposium,  Jan.  25-27,  1972.  pp.  24-29. 

P.R.  Leclercq  and  C.M.  Marcovici,  "Complex  Mission  Worth  Optimization  by  Redun¬ 
dancies,"  S.A.  Engins  MATRA,  78  -  Velizy  -  France,  pp.  354-360. 

E.R.  Levitt,  "Application  of  Decision-Making  Models,"  Westinghouse  Defense  and 
Space  Center,  Baltimore,  Maryland  21203.  pp.  446-457. 

G.G.  Naegele  and  D.W.  Sellinschegg,  "Test  Cost  Minimization  for  Reliability  Assess¬ 
ment,"  Institut  fur  Angewandte  Reaktorphysik  Kernforschungszentrum  Karls¬ 
ruhe,  Federal  Republic  of  Germany,  pp.  300-318. 

D.J.  Simkins,  "Weibull,  Bayes  and  Entropy,"  IBM  Electronics  Systems  Center,  Owego, 
New  York  13827.  pp.  190-199. 

J.T.  Stracener,  "Truncated  Sequential  Life  Tests:  Conditional  Probabilities  and  Test 
Time,"  Vought  Electronics  Company,  A  Division  of  LTV  Aerospace  Corporation, 
Dallas,  Texas,  pp.  376-385. 

W.T.  Sumerlin,  "Confidence  Calculations  for  MIL-STD-78 1,"  McDonnell  Aircraft 
Company,  McDonnell  Douglas  Corporation,  St.  Louis,  Missouri,  pp.  205-212. 

C.P.  Tsokos,  "A  Bayesian  Approach  to  Reliability:  Theory  and  Simulation,"  Depart¬ 
ment  of  Statistics  and  Statistical  Laboratory,  Virginia  Polytechnic  Institute  and 
State  University,  Blackburg,  Virginia,  pp.  78-87. 

Myron  A.  Wilson,  "Planning  Optimum  Maintainability  Demonstrations,"  General  Elec¬ 
tric  Company,  Philadelphia,  Pennsylvania,  pp.  137-147. 


Pipe  Network  Analysis 


A.G.  Collins  and  R.L.  Johnson,  "Finite-Element  Method  for  Water-Distribution  Net¬ 
works,"  Journal_JAW\VA,  July  1975,  pp.  385-389. 

J.H.  Dillingham,  "Computer  Analysis  of  Water  Distribution  Systems,"  Water  &  Sewage 
Works,  February  1967,  pp.  43-45. 

R.P.  Donachie,  "Digital  Program  for  Water  Network  Analysis,"  Journal  of  the 
Hydraulics  Division,  ASCE,  Vol.  100,  No.  HY3,  March  1974,  pp.  393-402. 

R.  Epp  and  A.G.  Fowler,"  Journal  of  the  Hydraulics  Division,  ASCE,  Vol.  96,  No.  HY1, 
January  1970,  pp.  43-56. 

H.K.  Kesavan  and  M.  Chandrashekar,"  Journal  of  the  Hydraulics  Division,  ASCE,  Vol. 
98,  No.  HY2,  February  1972,  pp.  345-364. 

P.F.  Lemieux,  "Efficient  Algorithm  for  Distribution  Networks,"  Journal  of  the 
Hydraulics  Division,  ASCE,  Vol.  98,  No.  HY 1 1,  November  1972,  pp.  191 1-1920. 

R.  deNeufville,  J.  Schaake,  Jr.,  and  J.H.  Stafford,  "Systems  Analysis  of  Water 
Distribution  Networks,"  Journal  of  the  Sanitary  Engineering  Division,  ASCE,  Vol. 
97,  No.  SA6,  December  1971,  pp.  825-842. 

N.S.  Zarghamee,  "Mathematical  Model  for  Water  Distribution  Systems,"  Journal  of  the 
Hydraulics  Division,  ASCE,  Vol.  97,  No.  HY1,  January  1971,  pp.  1-14. 


Power  Supply  Reliability 

R.N  Allan,  M.F.  DeOliveira,  J.A.  Chambers,  and  R.  Billinton,  "Reliability  Effects  of 
the  Electrical  Auxiliary  Systems  in  Power  Stations,"  UM1ST,  Manchester  U.K.; 
University  of  Porto,  Portugal;  C.E.G.B.,  Barnwood  U.K.  and  University  of 
Saskatchewan,  Canada,  pp.  28-31. 

R.N.  Allan  and  M.F.  DeOliveira,  "Reliability  Analysis  in  the  Design  of  Transmission 
and  Distribution  Systems,"  UM1ST,  Manchester,  U.K.  and  University  of  Porto, 
Portugal,  pp.  58-59. 

T.O.  Berntsen  and  Arne  T.  Holen,  "Security  Control  in  Network  System  Operation," 
The  Norwegian  Water  Resources  and  Electricity  Board,  Norwegian  Institute  of 
Technology,  pp.  154-157. 

R.  Billinton  and  C.  Singh,  "Generating  Capacity  Reliability  Evaluation  in  Intercon¬ 
nected  Systems  Using  a  Frequency  and  Duration  Approach;  Part  I.  Mathematical 
Analysis,"  Power  System  Research  Group,  University  of  Saskatchewan,  Presented 
at  the  IEEE  Winter  Power  Meeting,  New  York,  N.Y.,  Jan.  31-Feb.  5,  1971.  pp. 
1646-1654. 

R.  Billinton  and  C.  Singh,  "Generating  Capacity  Reliability  Evaluation  in  Intercon¬ 
nected  Systems  Using  a  Frequency  and  Duration  Approach;  Part  II.  System 
Applications,"  Power  System  Research  Group,  University  of  Saskatchewan,  Pre¬ 
sented  at  the  IEEE  Winter  Power  Meeting,  New  York,  N.Y.,  Jan.  31-Feb.  5,  1971. 
pp. 1654-1671. 

R.  Billinton  and  C.  Singh,  "Generating  Reliability  Evaluation  in  Interconnected  Sys¬ 
tems  Using  a  Frequency  and  Duration  Approach;  Part  III.  Correlated  Load 
Models,"  Power  System  Research  Group,  University  of  Saskatchewan,  Presented 
at  the  IEEE  Winter  Power  Meeting,  New  York,  N.Y.,  Jan.  30-Feb.  4,  1972.  pp. 
2143-2153. 

R.  Billinton  and  C.  Singh,  "System  Load  Representation  in  Generating  Capacity  Reli¬ 
ability  Studies;  Part  I.  Model  Formulation  and  Analysis,"  Power  System 
Research  Group,  University  of  Saskatchewan,  Presented  at  the  IEEE  Winter 
Power  Meeting,  New  York,  N.Y.,  Jan.  30-Feb.  4,  1972.  pp.  2125-2132. 

R.  Billinton  and  C.  Singh,  "System  Load  Representation  in  Generating  Capacity  Reli¬ 
ability  Studdies;  Part  II.  Applications  and  Extensions,"  Power  System  Research 
Group,  University  of  Saskatchewan,  Presented  at  the  IEEE  Winter  Power 
Meeting,  New  York,  N.Y.,  Jan.  30-Feb.  4,  1972.  pp.  2133-2142. 

H.  Bocker  and  W.  Kaufman,  "Optimizing  the  Expenditure  on  Reliability  of  Supply  in 
System  Planning  for  the  Benefit  of  the  Consumer,"  Universitat  Stuttgart, 
Technische  Werke  der  Stadt  Stuttgart  AG  (TWS),  pp.  18-23. 

H.  Burgess,  "The  Benefits  of  Quantitative  Analysis  in  the  Assessment  of  Electrical 
System  Reliability,"  National  Centre  of  Systems  Reliability,  UKAEA,  Culcheth, 
Warrington,  pp.  7-12. 

W.  Cartwright  and  B.A.  Coxson,  "Quantifying  the  Impact  of  Manual  Emergency 
Transfer  Switching  Upon  the  Reliability  of  Distribution  Systems,"  The  Electricity 
Council,  U.K.,  pp.  85-88. 


*  *vv  v**.'V  -'T- *r  rr  r?  rr  *r *? 


•■.  -•  ■■  ■  ,,7?V 


A.K.  David,  "Reliability  Criteria  for  Power  Systems  with  Inherent  Generation  and 
Transmission  Deficiencies,"  IEEE  Transactions  on  Reliability,  Vol.  R-30,  No.  1, 
April  1981.  pp.  18-21. 

W.  Edwin,  H.-D.  Kochs,  and  G.  Traeder,  "Possibilities  of  Reliability  Calculations  for 
the  Determination  of  Necessary  Fast  Reserve  of  Power  Systems  in  the  European- 
Interconnected-System,"  Institut  fur  Elektrische  Anlagen  und  Energiewirtschaft, 
RWTH  Aachen,  Federal  Republic  of  Germany,  pp.  137-140. 

H.  Khatib,  "A  Generating  System  Planning,  Performance,  and  Reliability  Assessment," 
Jordan  Electricity  Authority,  pp.  150-153. 

Krasnodebski  and  T.J.  Ravishankar,  "Reliable  Electric  Power  Generation  -  A  Utility's 
View  of  Supplier's  Role,"  Design  and  Development  Division,  Ontario  Hydro,  pp. 
35-37. 


B.  DeMartino,  G.  Fusco,  E.  Mariani,  and  L.  Vergelli,  "Considerations  Concerning  the 

Choice  of  Optimal  Risk  in  Medium  Term  Generation  Scheduling,"  ENEL,  National 
Control  Center,  Rome,  pp.  24-27. 

A.D.  Patton  and  A.  Stasinos,  "Variance  and  Approximate  Confidence  Limits  on  LOLP 
for  a  Single-Area  System,"  Electric  Power  Institute,  Texas  A&M  University, 
College  Station,  Texas.  IEEE  Transactions  on  Power  Apparatus  and  Systems, 
Vol.  PAS-94,  No.  4,  pp.  1326-1336.  July/August  1975. 

A.D.  Patton  and  A.  Stasinos,  "Practical  Methods  in  the  Computation  of  LOLP  Vari¬ 
ances  and  Confidence  Limits,"  Electric  Power  Institute,  Texas  A6cM  University, 
College  Station,  Texas.  IEEE  Transactions  on  Power  Apparatus  and  Systems, 
Vol.  PAS-95,  No.  1,  pp.  287-290.  January/February  1976. 

A.D.  Patton  and  N.H.  Tram,  "Parameter  Uncertainty  in  Generation  Reliability 
Studies,"  Electric  Power  Institute,  Texas  A&M  University,  College  Station, 
Texas,  USA,  pp.  123-126. 

A.H.  Peyrot,  M.E.  Criswell,  M.D.  Folse,  and  J.-P.  Aznavour,  "Reliability  Analysis  of 
Wood  Transmission  Poles,"  Journal  of  the  Structural  Division,  ASCE,  Vol.  108, 
No.  ST9,  September  1982,  pp.  1981-1994. 

Z.  Reguly,  "Costs  Evaluation  of  Power  Supply  Reliability,"  Hungarian  Electricity 
Board,  Politechnical  University  of  Budapest,  pp.  13-17. 

C.  Singh,  "Reliability  Modeling  Algorithms  for  a  Class  of  Large  Repairable  Systems," 

Transit  Systems  Research  &  Development  Branch,  Ontario  Ministry  of  Trans¬ 
portation  and  Communications.  Microelectronics  and  Reliability,  Vol.  15,  pp. 
159-162.  Pergamon  Press,  1076. 

C.  Singh  and  R.  Billinton,  "A  Frequency  and  Duration  Approach  to  Short  Term  Reli¬ 
ability  Evaluation,"  Power  System  Research  Group,  University  of  Saskatchewan. 
Presented  at  the  IEEE  PES  Winter  Meeting,  New  York,  N.Y.,  Jan.  2S-Feb.  2, 
1973.  pp.  2973-2083. 


A  .  , 


G.R.  Snaith,  "Reliability  Evaluation  of  an  Electrical  Supply  System  for  a  Nuclear 
Power  Station,"  National  Centre  of  Systems  Reliability,  Safety  and  Reliability 
Directorate,  UKAEA,  Culcheth,  Warrington,  pp.1-6. 

J.  Sozanski,  "Economic  Aspects  of  Reliability  in  Electric  Power  Supply,"  Technical 
University  of  Kielce,  Poland,  pp.  32-34. 

P.R.  Van  Horne  and  C.N.  Schoenberger,  "TRAP:  An  Innovative  Approach  to  Analyzing 
the  Reliability  of  Transmission  Plans,"  IEEE  Transactions  on  Power  Apparatus 
and  Systems,  Vol.  PAS-101,  No.  1,  January  1982,  pp.  11-16. 


Reliability  of  Real-Time  Systems 

C.  Singh,  "On  s-lndependence  in  'A  New  Method  to  Determine  the  Failure  Frequency  of 
A  Complex  System',"  IEEE  Transactions  on  Reliability,  Vol.  R-27,  No.  2,  June 
1978,  pp.  147-148. 

C.  Singh,  "On  the  Behaviour  of  Failure  Frequency  Bounds,"  IEEE  Transactions  on 
Reliability,  Vol.  R-26,  No.  1,  April  1977,  pp.  63-66. 

E.  Yourdon,  "Reliability  of  Real-Time  Systems,  Part  Is  Different  Concepts  of 

Reliability,"  Modern  Data,  January  1972,  pp.  36-42. 

E.  Yourdon,  "Reliability  of  Real-Time  Systems,  Part  2:  The  Causes  of  System 
Failures,"  Modern  Data,  February  1972,  pp.  50-56. 

E.  Yourdon,  "Reliability  of  Real-Time  Systems,  Part  3:  The  Causes  of  System  Failures 
(continued),"  Modern  Data,  March  1972,  pp.  36-41. 

E.  Yourdon,  "Reliability  of  Real-Time  Systems,  Part  4:  Examples  of  Real-Time 
System  Failures,"  Modern  Data,  April  1972,  pp.  52-57. 

E.  Yourdon,  "Reliability  of  Real-Time  Systems,  Part  5:  Approaches  to  Error 
Recovery,"  Modern  Data,  May  1972,  pp.  38-52. 


Repairable  Systems 


J.A.  Buzacott,  "Network  Approaches  to  Finding  the  Reliability  of  Repairable  Sys¬ 
tems,"  IEEE  Transactions  on  Reliability,  Vol.  R-19,  No.  4,  November  1970,  pp. 
140-146. 

H.  Kumamoto,  K.  Tanaka,  K.  Inoue,  and  E.3.  Henley,  "State-Transition  Monte  Carlo 
for  Evaluating  Large,  Repairable  Systems,"  IEEE  Transactions  on  Reliability, 
Vol.  R-29,  No.  5,  December  1980,  pp.  376-380. 

Y.S.  Sherif,  "Expected  Repair  Cost  Under  Doubly  Stochastic  Damage  Processes," 
Microelectronics  Reliability,  Vol.  22,  No.  1,  pp.  9-13,  1982,  Printed  in  Great 
Britain,  Pergamon  Press  Ltd. 

C.  Singh,  "Reliability  Analysis  of  Large  Repairable  Systems,"  Microelectronics  and 
Reliability,  Vol.  13,  pp.  487-493.  Pergamon  Press,  1974,  Printed  in  Great 
Britain. 

B.P.  Zelentsov,  "Reliability  Analysis  of  Large  Nonrepairable  Systems,"  IEEE  Tran¬ 
sactions  on  Reliability,  Vol.  R-19,  No.  4,  November  1970,  pp.  152-156. 


Time  to  Failure 


B. S.  Dhillon,  "Network  Reliability  Evaluation:  Application  of  Bathtub  Failure  Rate 

Curve,"  Microelectronic  Reliability,  Pergamon  Press  Ltd.,  Vol.  21,  No.  1,  1981, 
pp.  103-111. 

K.C.  Kapur  and  L.R.  Lamberson,  "Optimum  Test  Design  Strategies,"  Microelectronics 
and  Reliability,  Pergamon  Press,  Vol.  20,  pp.  75-81,  1980. 

C.  Singh,  "Reliability  Modeling  of  TMR  Computer  Systems  With  Repair  and  Common 

Mode  Failures,"  Microelectron  Reliability,  Pergamon  Press,  Vol.  21,  No.  2,  pp. 
259-262,  1981. 


Water  Distribution  Optimization 


E.  Alperovits  and  U.  Shamir,  "Design  of  Optimal  Water  Distribution  Systems,"  Water 
Resources  Research,  Vol.  13,  No.  6,  December  1977,  pp.  885-900. 

H. D.  Altinbilek,  "Optimum  Design  of  Branches  Water  Distribution  Networks  by  Linear 

Programming,"  1981  International  Symposium  on  Urban  Hydrology,  Hydraulics, 
and  Sediment  Control,  University  of  Kentucky,  Lexington,  Kentucky,  July  27-30, 
1981,  pp.  249-254. 

P.R.  Bhave,  "Noncomputer  Optimization  of  Singe-Source  Networks,"  Journal  of 
Environmental  Engineering  Division,  August  1978,  pp.  799-814. 

N.  Buras  and  Z.  Schweig,  "Aqueduct  Route  Optimization  by  Dynamic  Programming," 
Journal  of  the  Hydraulics  Division,  ASCE,  September  1969,  pp.  1615-1631. 

A.  Cenedese  and  P.  Mele,  "Optimal  Design  of  Water  Distribution  Networks,"  Journal  of 
the  Hydraulics  Division,  ASCE,  Vol.  104,  No.  HY2,  February  1978,  pp.  237-247. 

A.K.  Deb,  "Optimization  of  Water  Distribution  Network  Systems,"  Journal  of  the 
Environmental  Engineering  Division,  ASCE,  Vol.  102,  No.  EE4,  August  1976,  pp. 
837-851. 

A.K.  Deb,  "Optimization  in  Design  of  Pumping  Systems,"  Journal  of  the  Environmental 
Engineering  Division,  ASCE,  Vol.  104,  No.  EEl,  February  1978,  pp.  127-136. 

R. E.  Featherstone  and  K.K.  El-Jumaily,  "Optimal  Diameter  Selection  for  Pipe 

Networks,"  Journal  of  Hydraulic  Engineering,  ASCE,  Vol.  109,  No.  2,  February 
1983,  pp.  221-233. 

D. E.  Fleming,  R.K.  Hanson,  and  J.W.  Labadie,  "Integrated  Sizing  of  Water  Storage  and 

Conveyance,"  Jour,  of  Water  Resources  Planning  and  Management,  ASCE,  Vol. 
109,  No.  1,  January  1983,  pp.  94-111. 

I. C.  Goulter  and  D.R.  Morgan,  "Obtaining  the  Layout  of  Water  Distribution  Systems," 

Discussion;  January  1982,  by  William  F.  Rowell  and  J.  Wesley  Barnes  (Paper 
16817),  Jour,  of  the  Hydraulics  Division,  ASCE,  pp.  67-69. 

T.  Hashimoto,  D.P  Loucks,  and  J.R.  Stedubger,  "Reliability,  Resiliency,  and  Vulner¬ 
ability  Criteria  For  Water  Resource  System  Performance  Evaluation,"  Inter¬ 
national  Institute  for  Applied  Systems  Analysis,  Laxenberg,  Austria.  Water 
Resources  Research,  Vol.  18,  No.  1,  pp.  14-20,  February  1982. 

T.  Hashimoto,  D.P.  Loucks,  and  J.R.  Stedinger,  "Robustness  of  Water  Resources 
Systems,"  International  Institute  for  Applied  Systems  Analysis,  Laxenburg, 
Austria.  Water  Resources  Research,  Vol.  18,  No.  1,  pp.  21-26,  February  1982. 

S. L.S.  Jacoby,  "Design  of  Optimal  Hydraulic  Networks,"  Journal  of  the  Hydraulics 

Division,  ASCE,  Vol.  94,  No.  HY3,  May  1968,  pp.  641-661. 

E.  Kaily,  "Pipeline  Planning  Dynamic  Computer,"  Journal  AWWA,  pp.  114-118,  March 

1969. 


C.F.  Lam,  "Discrete  Gradient  Optimization  of  Water  Systems,"  Journal  of  _the 
Hydraulics  Division,  Vol.  99,  No.  HY6,  June  1973,  pp.  863-S72. 

G.E.  Quindry,  E.D.  Brill,  and  J.C.  Liebman,  "Optimization  of  Looped  Water  Dis¬ 
tribution  Systems,"  Journal  of  the  Environmental  Engineering  Division,  ASCE, 
August  1981,  pp.  665-679. 

G.E.  Quindry,  E.D.  Brill,  Jr.,  J.C.  Liebman,  and  A.R.  Robinson,  "Comment  onJDesign 
of  Optimal  Water  Distribution  Systems'  by  E.  Alperovits  and  U.  Shamir,"  Water 
Resources  Research,  Vol.  15,  No.  6,  December  1979,  pp.  1651-1655. 

W.F.  Rowell  and  J.W.  Barnes,  "Obtaining  Layout  of  Water  Distribution  Systems," 
Journal  of  the  Hydraulics  Division,  ASCE,  Vol.  108,  No.  HY1,  January  1982,  pp. 
137-148. 

U.  Shamir,  "Optimal  Design  and  Operation  of  Water  Distribution  Systems,"  Water 
Resources  Research,  Vol.  10,  No.  1,  February  1974,  pp.  27-36. 

U.  Shamir  and  C.D.D.  Howard,  "Water  Distribution  Systems  Analysis,"  Journal  of_the 
Hydraulics  Division,  ASCE,  Vol.  94,  No.  HY1,  January  1968,  pp.  219-234. 

T  M  Walski,  "Techniques  for  Calibrating  Network  Models,"  Journal  of  Water — R- 
esources  Plannine  and  Management,  ASCE,  Vol.  109,  No.  4,  October  1983,  pp. 
360-372. 

T.  Watanatada,  "Least-Cost  Design  of  Water  Distribution  Systems,"  Journal  of.., the 
Hydraulics  Division,  ASCE,  Vol.  99,  No.  HY9,  September  1973,  pp.  1497-1513. 


v*. 


a  j-w  . 


y. 


Water  Distribution  Reliability 

K.J.  Carl,  R.A.  Young,  and  G.C.  Anderson,  "Guidelines  for  Determining  Fire-Flow 
Requirements,"  Journal  AWWA,  May  1973,  pp.  335-344. 

R.M.  Clark,  C.L.  Stafford,  and  3. A.  Goodrich,  "Water  Distribution  Systems:  A  Spatial 
and  Cost  Evaluation,"  Journal  of  the  Water  Resources  Planning  and  Management, 
ASCE,  Vol.  108,  No.  WR3,  October  1982,  pp.  243-256. 

E.  Damelin,  U.  Shamir,  and  N.  Arad,  "Engineering  and  Economic  Evaluation  of  the 
Reliability  of  Water  Supply,"  Water  Resources  Research,  Vol.  8,  No.  4,  August 
1972,  pp.  861-877. 


L.S.  Lazarides,  "Head  Losses  in  Networks  by  a  Probabilistic  Approach,"  Hydraulic  and 
Sanitary  Consulting  Engineer,  Athens,  Greece,  24  pp. 

F.P.  Linaweaver,  Jr.,  and  C.S.  Clark,  "Costs  of  Water  Transmission,"  Journal  AWWA, 
December  1964,  pp.  1549-1560. 


K 


E.E.  Moyer,  et  al.,  "The  Economics  of  Peak  Detection  and  Repair — A  Case  Study," 
Journal  AWWA,  January  1983,  pp.  29-34. 

J.  Roller,  et  al.,  "Water  Systems  Operation,"  Journal  AWWA,  Plant  Operations,  March 
1973,  pp.  224-226. 

S.M.  Seward,  D.R.  Plane,  and  T.E.  Hendrick,  "Municipal  Resource  Allocation:  Mini¬ 
mizing  the  Cost  of  Fire  Protection,'1  Management  Science,  Vol.  24,  No.  16, 
December  1978,  pp.  1740-1748. 

U.  Shamir  and  C.D.D.  Howard,  "An  Analytic  Approach  to  Scheduling  Pipe  Re¬ 
placement,"  Journal  AWWA,  Management  and  Operations,  1979,  pp.  248-258. 

U.  Shamir  and  C.D.D.  Howard,  "Water  Supply  Reliability  Theory,"  Faculty  of  Civil 
Engineering,  Technion,  Haifa,  Israel.  President,  Charles  Howard  &  Associates, 
Ltd.,  Winnipeg,  Canada.  Abstract,  22pp. 


J.H.  Stacha,  "Criteria  for  Pipeline  Replacement,"  Journal  AWWA,  Management  and 
Operations,  1978,  pp.  256-258. 


T.M.  Walski,  "Economic  Analysis  of  Rehabilitation  of  Water  Mains,"  Journal  of  the 
Water  Resources  Planning  and  Management,  ASCE,  Vol.  108,  No.  WR3,  October 
1982,  pp.  296-308. 


T.M.  Walski  and  A.  Pelliccia,  "Economic  Analysis  of  Water  Main  Breaks,"  Journal 
AWWA,  Research  and  Technology,  March  1982,  pp.  140-147. 


J.B.  Wolff,  "Peak  Demands  in  Residential  Areas,"  Journal  AWWA.  October  1961.  do. 
1251-1260. 


B.C.  Yen,  "Uncertainties  on  Roughness  for  Pipe  Design,"  Department  of  Civil 
Engineering,  University  of  Illinois  at  Urbana-Champaign,  Illinois,  15pp. 


"Water  Distribution  Research  and  Applied  Development  Needs,"  Committee  Report, 
AWWA  Research,  June  1974,  pp.  385-390. 


rw..v-  -v  v v'c-.  '.'••.■•o 


A24 


Background 


1.  Although  a  great  deal  of  literature  has  been  published  on  network 
reliability  evaluation,  there  are  few  proven  computer  codes  available.  This 
appendix  presents  a  brief  discussion  of  several  computer  codes  that  have  been 
developed  for  risk  and  reliability  evaluation. 

2.  Henle/  and  Kumamoto  (1981)  identified  several  computer  codes  with 
application  to  risk  and  reliability  analysis: 

a.  Program  title:  KITT-IT 

Abstract:  Top-event  parameters  Qs  »  ws  *  f°r  systems 

using  kinetic  tree  theory  (augmented  KITT  program 
which  handles  storage  tanks  and  standby  redundancy) . 

Number  of  source  statements:  1,361 

User  manual:  24  pages 
_b.  Program  title;  PATH-CUT 

Abstract:  Conversion  of  minimal  path  sets  into  minimal  cut 

sets,  and  vice  versa,  using  a  classification  method. 

Number  of  source  statements:  250 

User  manual:  15  pages 
£.  Program  title:  PROTECT 

Abstract:  Based  on  a  fault  tree  for  a  plant  and  two  fault  trees 
for  a  protective  system,  the  computer  code  produces 
time  profiles  of  expected  numbers  of  normal  trips, 
spurious  trips,  and  destructive  hazards. 

Number  of  source  statements:  340 

User  manual:  75  pages 
ji.  Program  title:  PITE 

Abstract:  Simplifying  decision  tables  using  Quine's 

consensus  theory. 

Number  of  source  statements:  940 

User  manual:  200  pages 
e.  Program  title:  PRTME 

Abstract:  Generating  prime  implicants  for  noncoherent  fault 
trees  by  using  a  classification  method. 


Number  of  source  statements:  980 
User  manual:  40  pages 


f.  Program  title:  MARKOV 

Abstract:  This  code  calculates  time  profiles  of  state  probabil¬ 
ities  for  a  Markov  transition  diagram.  State  transi¬ 
tion  matrix  is  used  for  numerical  integration  of 
linear  differential  equations. 

Number  of  source  statements:  310 

User  manual:  15  pages 

g.  Program  title:  NLB  (New  Lawler  and  Bell) 

Abstract:  Redundancy  optimization  using  Lawler  and  Bell's 
integer  programming. 

Number  of  source  statements:  320 

User  manual:  15  pages 

h.  Program  title:  HEUR 

Abstract:  Reliability  optimization  under  constraints  on  cost, 
weight,  etc.,  by  using  a  heuristic  approach. 

Number  of  source  statements:  310 

User  manual:  20  pages 

i.  Program  title:  SCHE 

Abstract:  Conversion  of  reliability  block  diagrams  into  fault 
trees. 

Number  of  source  statements:  970 

User  manual:  40  pages 

j_.  Program  title:  CONVERSION 

Abstract:  Obtains  minimal  cut  sets  through  expansion  of  product 
of  sum  expression  of  top  event,  given  minimal  path 
sets . 

Number  of  source  statements:  220 

User  manual:  20  pages 

k.  Program  title:  FAMULS  (Fault  Tree  for  Multi-Loop  Systems) 

Abstract:  Generating  cut  sets  for  systems  with  multiple  control 
loops,  given  signal  flow  graph  representation  of 
plant . 

Number  of  source  statements:  620 

User  manual:  40  pages 

3.  The  following  eight  computer  programs  are  available  from  JBF  Associ¬ 
ates,  10700  Dutchtoun  Drive,  Knoxville,  Tennessee  37922.  Those  with  an 
asterisk  are  also  available  from  the  Argonne  National  Laboratory,  Code  Center, 
Argonne,  Illinois  60439. 


a.  *Program  title:  MOCUS 

Abstract:  Obtains  minimal  cut  sets  or  path  sets  for  fault  trees 
with  AND/OR  and  INHIBIT  logic. 

Minimum  core:  228K  (IBM) 

Number  of  source  statements:  1,800 

b.  *Program  title:  PREP 

Abstract:  Obtains  cut  sets  or  path  sets  from  fault  trees  with 
AND/OR  and  INHIBIT  logic  using  combinatorial  testing. 

Number  of  source  statements:  1,200 

c.  Program  title:  BACFIRE 

Abstract:  Aids  in  common  cause  failure  analysis  by  commonality 
searches . 

Number  of  source  statements:  800 

d.  *Program  title:  KITT-1 

Abstract:  For  calculating  top-event  parameters  Q  ,  W  ,  \ 

given  cut  sets  and  failure  and  repair  rates  for 
components.  Uses  kinetic  tree  theory. 

Number  of  source  statements:  1,800 

e.  *Program  title:  KITT-2 

Abstract:  A  version  of  KITT-1  which  permits  the  input  of  time- 
varying  failure  and  repair  rates. 

Number  of  source  statements:  1,700  ' 
f_.  *Program  title:  SAMPLE 

Abstract:  Uses  Monte  Carlo  techniques  to  obtain  confidence 

limits  for  top  events,  given  confidence  limits  for 
component  failure  and  repair  rates. 

Number  of  source  statements:  400 
£.  Program  title:  SUPERPOCUS 

Abstract:  A  simplified  KITT  which  uses  bounding  theorems  to 

approximate  top-event  probabilities.  Also  calculates 
Fusse 1 1-Vesely  important. 

Number  of  source  statements:  600 
h.  Program  title:  TREDA 

Abstract:  Draws  report-quality  fault  trees  using  a  CALCOMP 

plotter . 

Number  of  source  statements:  3,700 

4.  The  firm  Computer  Aids  lor  Design  Engineers  and  Scientists  (COADE) , 
8550  Katy  Freeway,  Suite  122,  Houston,  Texas  77024,  telephone  (713)  973-9060, 


sells  a  number  of  computer  codes  applicable  to  risk  and  reliability  analysis. 
The  computer  codes  sold  by  COADE  are  designed  to  run  under  the  IBM  PC  operat¬ 
ing  system.  These  programs  can  be  installed  on  microcomputers  or  mainframe 
computers.  A  list  of  the  available  programs  and  a  brief  description  are  given 
below: 

a.  DOWNTIME  (formerly  RELICS)  Systems  Reliability.  DOWNTIME  eval¬ 
uates  the  upper  and  lower  bounds  for  system  availability,  mean 
time  to  failure,  mean  time  to  repair,  as  well  as  component 
importance.  It  uses  reliability  block  diagrams  with  one  source 
and  one  sink  node.  Cut  sets  and  path  sets  are  computed  and 
displayed.  DOWNTIME  requires  data  on  component  mean  time  to 
failure,  mean  time  to  repair,  and  the  block  diagram  structure. 
DOWNTIME  can  handle  up  to  50  blocks. 

b.  MARKOV  Systems  Analysis  by  the  Markov  Method.  MARKOV  numeri¬ 
cally  integrates  a  set  of  differential  equations  to  give  a 
Markov  transition  diagram.  System  characteristics  such  as 
availability  and  reliability  can  be  output  in  response  to  user 
specifications.  The  program  automatically  generates  the  dif¬ 
ferential  equations  from  node-arc  relationships  of  the  transi¬ 
tion  diagram.  Applications  of  MARKOV  include: 

(1)  Common  cost  analysis. 

(2)  Standby  redundancy. 

(3)  Mission  reliability. 

(4)  Availability  analysis. 

(5)  Sequential  failure. 

(6)  Statistically  dependent  failures. 

Arcs  and  nodes  can  be  readily  deleted  or  added.  This  makes  the 
program  ideal  for  sensitivity  analysis.  Transition  rates  can 
also  be  easily  changed.  Up  to  50  nodes  can  be  handled  and 
100  characteristics  of  the  diagram  can  be  output. 

c.  BACFIRE  Common  Cause  Analysis.  A  common  cause  failure  analysis 
is  an  essential  part  of  complete  system  safety  and  reliability 
analysis.  BACFIRE  searches  for  common  potential  causes  of  fail¬ 
ure  among  the  basic  events  of  a  minimal  cut  set  of  the  system 
logic  model.  A  minimal  cut  set  with  a  common  potential  cause  of 
failure  is  called  a  common  cause  candidate.  When  a  common  cause 
candidate  is  identified,  it  is  listed  with  its  common  potential 
cause  of  failure  as  output  from  BACFIRE. 

d.  SUPERPOCUS  Systems  Analysis  from  Cutsets.  SUPERPOCUS  is  used 
during  quantitative  reliability,  safety,  and  risk  evaluations. 
Civen  minimal  cut  sets  and  mean  times  to  failures  of  system  com¬ 
ponents,  SUPERPOCUS  will  calculate  system  probabilistic  reli¬ 
ability  and  safety  characteristics,  employing  tightly  bounded 
approximation  methods.  Results  include  time-dependent  reliabil¬ 
ity  characteristics  such  as  unavailability,  reliability,  and 
expected  number  of  failures  in  the  system.  Data  can  be 


passed  to  Sl'PERPOCUS  from  the  qualitative  reliability  programs 
MOCUS,  PATH-CUT,  and  DOWNTIME. 

MOCUS  Cutsets  and  Path  Sets  for  Fault  Trees.  From  a  given 
graphical  representation  of  a  fault  tree  (Boolean  logic  fail¬ 
ure),  the  system  failure  modes  (called  the  minimal  cut  sets)  and 
the  system  success  modes  (called  the  minimal  path  sets)  can  be 
determined  by  MOCUS.  Input  can  be  control  parameters  specifying 

(1)  the  minimal  cut  set  length  to  be  obtained. 

(2)  the  type  of  minimal  sets  to  be  obtained. 

(3)  the  output  options. 

A  description  of  the  fault  tree  is  also  needed.  MOCUS  is  a 
thoroughly  tested,  widely  used  program. 

SCHE  Block  Diagrams  to  Fault  Trees.  Given  a  process  block 
(reliability)  diagram,  SCHE  will  generate  a  fault  tree,  the  top 
event  being  the  failure  of  all  system  paths.  The  fault  tree 
produced  is  a  true,  sequential  representation  of  possible  modes 
of  system  failure.  The  program  is  useful  for  sensitivity  anal¬ 
yses  insofar  as  the  blocks  can  be  changed,  deleted,  or  added. 
Output  nodes  can  be  moved  and  manipulated.  SCHE  generates  fault 
trees  for  block  diagrams  having  up  to  50  nodes. 

SAMPLE  Confidence  Limits  of  Top  Events.  SAMPLE  uses  the  Monte 
Carlo  sampling  technique  to  obtain  the  mean,  standard  deviation, 
probability  range,  and  distribution  for  a  multivariable  func¬ 
tion.  The  independent  variables  are,  for  example,  component 
reliabilities  being  subject  to  statistical  variations  and  the 
dependent  variable  such  as  the  system  reliability.  Given  values 
for  the  location  and  dispersion  parameters  of  the  independent 
variables,  and  a  specific  input  distribution,  SAMPLE  obtains  a 
Monte  Carlo  sample  of  the  independent  variables  and  evaluates 
the  corresponding  function.  The  program  has  three  output  dis¬ 
tributions:  normal,  lognormal,  and  log-uniform.  SAMPLE  is  very 
useful  because  system  characteristics  such  as  reliability  and 
availability  can  be  evaluated  as  a  range  an d  not  a  point 
est imate . 

PATH  CUT  Minimal  Path  to  Minimal  Cut  Sets.  PATH  CUT  converts 
minimal  path  cuts  into  minimal  cut  sets  using  a  classification 
method.  This  is  a  top-down  approach  which  converts  the  smaller 
number  of  path  sets  (which  i  an  he  1 ouiui  hv  inspection  of  reli¬ 
ability  block  diagrams ,  or  hv  MOCUS  or  RIG  ICS)  into  cut  sets 
without  a  pairwise  comparison  or  redundant  sets.  Hence ,  PATH 
CUT  is  taster  and  im  re  eit  icient  than  MoC  S-hase-1  algorithms.  A 
path  set  can  r.e  milled  or  evicted  <mm  lv,  f ;»«  i  1  itat  ing  sensitiv¬ 
ity  analyses.  PATH  <G  T  an  conceit  .c  r>anv  as  ru>  path  sets  with 
a  maximum  of  hi'  element  :  t  minim..!  »  ut  sets.  Cut  sets  are 
output  one  by  one  as  thr\  are  t  t  :o  i,  and  thus  the  program 
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components  to  be  allocated  to  each  block  while  maximizing  the 
system  reliability  and  meeting  with  constraints  such  as  cost, 
weight,  and  volume.  HEUR1  solves  the  redundancy  optimization 
problem  by  a  heuristic  integer  programming  technique  which  is 
based  on  a  steepest  ascent  in  one  neighborhood.  Each  local 
optimum  reached  is  checked  for  optimality  in  two  neighborhoods. 
Constraints  can  be  relaxed,  narrowed,  deleted,  or  added,  thus 
facilitating  sensitivity  analysis.  Up  to  50  blocks  can  be 
handled. 

HEUR2  Redundancy  Allocation.  HEUR2  is  an  advanced  version  of 
HEUR1.  Given  a  reliability  block  diagram,  the  program  will 
yield  the  optimal  number  of  redundant  components  to  be  allocated 
to  each  block,  while  maximizing  the  system  reliability  and  meet¬ 
ing  with  constraints  such  as  cost,  weight,  and  volume.  HEUR2 
uses  a  steepest  ascent  in  two  neighborhoods  rather  than  an 
ascent  in  one  neighbor  as  in  HEUR1.  Constraints  can  be  relaxed, 
narrowed,  deleted,  or  added.  HEUR2  will  generate  a  solution 
close  to  the  global  optimum,  starting  from  trivial  initial 
guesses.  Up  to  40  blocks  can  be  handled. 

CONVERSION  Minimal  Paths  to  Minimal  Cut  Sets.  Reliability  block 
diagrams  and  fault  trees  usually  have  a  smaller  number  of  path 
sets  than  cut  sets.  The  minimal  path  sets  can  be  found  easily 
by  inspection  or  by  use  of  MOCUS  or  RELICS.  Given  a  number  of 
path  sets,  CONVERSION  inverts  them  into  minimum  cut  sets  by 
expansion  of  a  product  of  a  sum  expression  of  the  top  events 
with  absorption  of  redundant  terms.  A  path  set  can  be  added  or 
deleted,  facilitating  sensitivity  analysis.  CONVERSION  can 
handle  as  many  as  50  paths  with  a  maximum  of  50  elements.  It 
generates  as  many  as  800  minimal  cut  sets  with  a  maximum  size 
of  11. 


